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0. In a recent paper in Language con- 
' cerned primarily with the statistical descrip- 
' tion and the internal ation of 
Indo-European, as this subject was recently 
reopened (also in Language) by Ellegird, 
I was stimulated by the latter’s precedent 
to add a final section on taxonomy in 
| language study. Not only did I agree with 
Ellegard that the total field of linguistics 
needed more systematic classification, but 
I supported the opinion that we were nearing 
the point of recognizing that there may 
have been on the average more external 
influencing in the history of human lan- 
guages (diffusion, ‘wave theory’) than 
internal genetic development traceable by 
the standard comparative philological 
-method—much as in the _ non-linguistic 
‘domains of culture. If this is so, no amount 
of reconstructive phylogeny will give us 
more than partial and isolated pieces of the 
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total history of human language. With the 
genetic development of any group of related 
idioms pursued farther and farther back in 
time, the evidence becomes more ambiguous 
and the reconstruction more speculative, 
until the far end is completely open: the 
language just has no bottom any longer. So 
far as we can tell, it is conceivable that all 


- languages now spoken or written on earth 


go back to a single common origin. But 
even if so, it is clear that beyond a point 
which we can already foresee, although we 
cannot precisely define its time distance, 
the reconstructed data will be so vague and 
chaotic that we can never hope to tie them 
together with assurance. The reconstructed 
genetic groups will have reduced to narrower 
and narrower cones and finally mere lines 
of descent, which, even if they head toward 
a seeming point of meeting, occupy so little 
of the total area of speech then existing, 
that probability will be only infinitesimally 
small that the vanishing point or points 
will be near any historic actuality. 

In short, the very virtue of the compara- 
tive genetic method, the fact that it deals 
with the strict evidences of point-for-point 
homologies, also sets a necessary limitation 
to its scope. What falls outside the cone of 
its homologies, even though historically it 
lies within the genetic bundle of idioms 
investigated, is not reached by the method; 
while everything else beyond the bundle is 
of course precluded by definition. 

It is thus evident that the main area of 
linguistics, especially so far as this is con- 
cerned with the total multiplicity of lan- 
guages, will have to be tilled by non-genetic 
methods. That means non-genetic compari- 
son. At any rate, such comparison, on a 








172 


systematic basis, is a first step and a neces- 
sary one. It will yield indications of similar- 
ity and relationship, whether these be due 
to internal development or to influence 
from without; and these indications may 
suggest what may ultimately be proven to 
be genetic and what not. 

When we reflect on the fact that hundreds 
of different languages now possess modern, 
competent descriptive grammars, it is 
astonishing how little comparative use has 
been made of them. Either we lack interest 
in comparing non-genetic groups, or we 
lack knowledge of how to compare them 
fruitfully. 

Of course comparison is not the only 
problem in non-genetic linguistics. The 
purpose of comparison is to establish first 
the range of variations and similarities, and 
then to give historic meaning to the syn- 
chronic variations and similarities. There is 
also a synchronic approach which is without 
ultimate diachronic motivation, but looks 
for constant principles. This approach has 
given us, for instance, the concepts of the 
phoneme and the morpheme. It goes without 
saying that both interests are legitimate, 
but they must follow different methods. In 
the far run, they tend to interact helpfully. 
For instance, comparisons today can have 
more meaning than they would have had 
before the phoneme and morpheme concepts 
were formulated. 

But cross-genetic comparison is out of 
fashion. Perhaps the presence of the estab- 
lished method of dealing with genetic 
homologies has helped block its develop- 
ment. I believe that the recent growth of 
linguistics has been great enough that we 
are losing opportunities by hanging back 
from undertaking serious cross-genetic com- 
parison, especially now that the limits of 
productivity of the genetic method are 
becoming more evident. It is obvious that 
we shall never attain to a generic or total 
taxonomy of language without cross-genetic 
comparison. 

In my paper in Language I suggested that 
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Greenberg’s return to a typological approach 
(in the Wallis Festschrift, see Spencer in 
the Bibliography) was a promising start, 
though I also deplored Greenberg’s seeming 
lack of confidence or interest in following 
up his tentative endeavor. The cld deductive 
and ethnocentric typology of languages 
that began with the Schlegels and W. (not 
A.) von Humboldt had come to a dead end 
with Schleicher. Fifty years later Sapir 
made a gallant attempt to evolve a sounder 
and richer typology, and certainly succeeded, 
Yet in spite of occasional interest by anthro- 
pologists and others, the linguistic profession 
has essentially ignored Sapir’s classification. 
Perhaps this was why he himself never 
returned seriously to it. 
Now, thirty years after Sapir, Greenberg 
has picked up where Sapir left off, using 
Sapir’s ideas—with one important difference. 
Sapir still classified languages as unit 
wholes, according to the degree in which 
certain structural categories were manifest 
in them. Greenberg’s originality lies in 
comparing the features or categories instead 
of the whole languages. This has enabled 
him to develop a scale for measuring or 
expressing the degree ‘of development of 
each feature by a numerical index. The 
composite typological index is thus an 
assemblage of indices for ten features. It 
makes possible the expression of more nu- 
merous combinations of features, and repre- 


sents a definite gain in analytic penetration |’ 


and flexible description, over what Sapir 
could attain with his typing of language 
wholes. Far improved over his predecessors’ 
classes as Sapir’s classes of languages were, 
his synthetic intuitive results were some- 
times awkward compromises. For instance, 
in his Fundamental Type B, ‘Complex 
Pure Relational,” particular languages are 
cited as being Agglutinative, Fusional, 
Symbolic, Agglutinative-isolating, Fusional- 
isolating, Fusional-agglutinative in ‘tech- 
nique,’ each with or without symbolic 
tinge; and further, the same languages are 
cross-classified as being also either analytic, 
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nildly synthetic, synthetic, mildly poly- 
ynthetic, polysynthetic. 

Greenberg’s typological indexing contains, 
in dealing with features on a quantitative 
sale for each, a novel point of view and a 
procedure of broad import. It can probably 
be extended to include additional features 
w categories which Sapir did not utilize in 
bis 1921 classification and which therefore 
Greenberg omitted from his total index. I 
hope to follow with a paper making some 
siggestions as to such broadening of scope. 
Meanwhile, however, it seems worth while 
io demonstrate that Greenberg’s illustration 
a his index, by application of it to eight 
ample languages, reveals even more results 
than he claims for it. Possibly with his 
quick wits and nimble faculty of generaliza- 
tion, Greenberg has sensed the significances 
which I am about to make explicit, assuming 
am audience which would automatically 
reognize them. Yet the lack of general 
positive recognition accorded his paper in 
five years suggests that linguists have 
construed his casual manner, brevity of 
substantial evidence, and lack of follow-up, 
as evidence of a degree of tentative half- 
heartedness on his own part. I do not share 
this feeling: the paragraphs that follow aim 
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to show that Greenberg has started some- 
thing of great possibilities. 


1. I reproduce first, as Table 1, Green- 
berg’s tabulation of ten index ratings of 
eight languages, so that my discussion of 
them can be more readily followed. It may 
also be of general convenience to have 
Greenberg’s basic tabulation available in a 
linguistic journal. His paper is a lone linguis- 
tic one in a Festschrift that appears to have 
had no wide circulation. I use his names for 
the 10 indices but have renumbered them 
1, 2, 3a, 3b, 3c, 4a, 4b, 5a, 5b, 5c instead of 
i-10 to indicate their grouping. I have also 
added the formulaic ratio for computing 
each index, plus explanation of the abbrevia- 
tions used in the formulas. 


2. Rank orders within each index are 
substituted in Table 2 for Greenberg’s 
Table 1 values. For instance, under the 
first index, Synthesis, the Eskimo value of 
3.72 is highest, and therefore its entry is 1; 
Sanskrit is second with 2.59, Swahili a 
close third with 2.55, and thus down to 
Annamese with 1.06, the last, or rank 
number 8. And so on for all ten indices. For 
convenience, all firsts are blackface, and 


TABLE 1 
Greenberg’s Typological Index Values 





| | | 




















Sk | AS | Per | Eng | Yak | Swa | Ann | Esk 
1. Synthesis | m/w | 2.50 | 2.12 | 1.52 | 1.68 | 2.17 | 2.55 | 1.06 | 3.72 
2 Agglutination | A/J | .09 | .11 | 34 | .30 | .51 | .67 | .. | .03 
3a. Composition | BPM Meh idee | I 1 ee) BR Bee Le] dee 
3b. Derivation | D/W 62 20 | .10 15 35 | 07 | — | 1.25 
%. Gross Inflection | I/W | .84 | .90 | 39 | .68 |} .82 | .80 | — | 1.75 
‘a, Prefixation | pew} 16 | .o6 | 1 | of} — | 1.16] — | — 
tb. Suffixation oiimos ce. See See Cee ee Cee: 
da. Isolation | O/N .16 15 62 | .75 .29 | .40 | 1.00 .02 
5b. Pure Inflection | Pi/N .46 47 29 | .14 Laid RP orahRe) uct 46 
je. Concord | C/N .38 | .38 19 | te ee: a rt. .38 





Concord. 


M Morpheme; W Word; A Agglutination, lack of morphophonemic change at morpheme juncture; J 
morpheme juncture; R Root morpheme; D Derivational morpheme; I Inflectional morpheme; P Pre- 
fixation; S Suffixation; O word Order; N Nexus, expression of word interrelations; Pi Pure inflection; C 


Based on Greenberg, p. 218, in R. F. Spencer, ed., 1954. 
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TABLE 2 
Rank Orders of Languages Substituted for Index Values 
| Sk ss | te Eng Yak Swa Ann Esk 
1. Synthesis | M/W 2 5 7 6 4 3 8 1 
2. Agglutination | A/J 6 = 3 diy 1 i 7 
3a. Composition | R/W 1 3 5-8 4 5-8 5-8 
3b. Derivation | D/W 2 “ | 6 5 3 7 8 1 
3c. Gross Inflection | I/W 3 2 | 4 6 | 4 5 8 z 
4a. Prefixation le eh 2 3 | 5 4 | 6-8 | 1 6-8 6-8 
4b. Suffixation | S/W | 2 | 4 | 6 5 3 4 7 8 | 
5a. Isolation DO TS Hf] ek ie Bk. ris # 1 8 
5b. Pure Inflection | Pi/N | 3-4 | 2 | te 1 6 8 3-4 
5c. Concord C/N | 24 2-4 | 5 | 7 6 1 8 2-4 
TABLE 3 each. Yakut is tied once for last, Sanskrit’: 
Frequencies of Certain Rank Orders per lowest rank is 6. 

Language 3-4 3.2. The three Indo-European language 
| | Lowest Index Other than Sanskrit do not once - plac 
| .~ | arent a first, nor once definitely last, though two 
| Absol. | Tied of them are tied once for last and each 

i | | | | sinks once or twice to next to lowest place, 
Eskimo | 4 Se OL here Lado-B : id | 

Swahili Bantu | 3 ze p=} n o-Huropean 1S eVl ently a group not 
idiots Ce Bec coe aa ee given to extremes. This applies even to 
Sanskr | | | pp 

Yakut Turkish Bat Pixed — {1 Sanskrit, much the oldest of the four Indo- 
Anglo-Saxon Wy 2 er | 1 European languages dealt with, for apart 
English ; wirran & | til from its single instance of rank 1 (due t 
Mod. Persian — | => re i ene" f d Senshi 
lalla pent @ | goby its fondness for compound nouns), Sanskrit 





* 9 indices only. 


all eighths italicized; 3-4, 5-8, mean there 
is a double or quadruple tie for those rank 
positions. 


3. A frequency selection of the probably 
most significant ranks, the highest and 
lowest, from Table 2 is given in Table 3 in 
a rearranged order. It is at once evident 
that marked differences characterize the 
languages. 

3.1. Eskimo has highest rank 4 times, is 
lowest or tied for lowest 3 times in the ten 
indices. It is a language of extremes, evi- 
dently, from a generic point of view. 

Next comes Bantu Swahili, with 3 firsts, 
but only 1 tie for last. 

Still giving prime consideration to firsts, 
we find Yakut and Sanskrit with 1 first 


ranges wholly between ranks 2 and 6. h 
spite of the sample being small, it suggests 
that Indo-European may prove to be a 
essentially middle-of-the-road or average 
type of human language, instead of being 
the most advanced or perfect as used to b 
thought. 

3.3. Annamese, the only representative 
in the sample of the long recognized ‘isolat- 
ing’ class, comes out, expectably, as bein} 
extreme at the opposite end from Eskimo 
It achieves 1 first (in ‘Isolation’), 1 secon 





(in ‘Composition’), and is last or tied fa 
last in its remaining 7 features. 

3.4. The upshot is that the Greenber 
index-scoring method provisionally validate 
two of the old classes of languages, but 
fails to support two others. Isolating ani 
Polysynthetic remain as actual classed 
Agglutinating and Inflectional do not. | 
was in regard to these two latter that ethn 
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centric rating was most successfully active 
in nineteenth-century typing. Indo-Euro- 
pean languages were rather remote from 
obtrusively manifesting either isolating or 
polysynthetic qualities, so there was less 
judgment-warping in regard to these two 
features. To be sure, stem-compounding is 
undoubtedly a form of synthesis, but the 
Sanskrit and classic Greek synthetic propen- 
sities did not happen to include verb-verb 
or verb-object compounds nor certain 
other forms of synthesis, so that it was 
easy to keep Indo-European out of the 
polysynthetic ‘contamination.’ 

3.5. It is however clear that Sanskrit, 
the oldest of the four Indo-European 
languages subjected to test by Greenberg, 


has a high overall synthetic propensity, © 


aside from the binary compound nouns in 
which it ranks number 1. It stands second 
in rank in no less than four other traits 
that contribute toward generic synthetic 
quality, namely, Derivation, Prefixation, 
Suffixation, and general Synthesis (ratio of 
morphemes to words). 

3.6. This conclusion is confirmed if we 
summate the values of Table 1. We find 
the following seriated results: Eskimo, 
11.33, Sanskrit 7.51, Swahili 7.06, Yakut 
7.02, Anglo-Saxon 6.42, English 5.34, 
Modern Persian 4.88, Annamese 3.13. Here 
Sanskrit comes between Eskimo on one side 
and Swahili and Yakut on the other, and 
is well separated from the other three 
Indo-European languages. To be sure, 
summation is only a rough method of 
seriating classification, especially when the 
values of the several indices vary greatly in 
absolute size—on which point I hope to 
say more at some later time; nevertheless it 
probably has some indicative significance. 


4. As to the change of structural type 
that has occurred within Indo-European, 
the results contained in Table 1 come out 
more clearly if we isolate the Indo-European 


' index values and then arrange them in 
| Some rank order, preferably perhaps the 
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TABLE 4 


Indo-European Rank Orders in the Sequence 
of High Ranks in Sanskrit 


























5e. Concord 2- 
5b. Pure Inflection 3 
2. * Agglutination 
5a. * Isolation 
Order) 
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Index Sk A-S | Pers | Engl 

3a. Composition 1 | 5-8} 3 158 
4a. Prefixation 2; 3 5 4 
4b. Suffixation 2 4 6 5 
3b. Derivation 2 4 6 5 
1. Synthesis 2 5 7 6 
3c. Gross Inflection 3 2 7 6 
24) 5 7 

5 7 

3 4 

3 2 
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order of strength which the ten traits show 
in Sanskrit, the earliest. Table 4 gives the 
results. While Anglo-Saxon is of course 
ancestral to English, it is so by somewhat 
less than 1500 years, whereas its time falls 
about equally long after Sanskrit. It will be 
seen that in conformity with its midway 
temporal position, Anglo-Saxon averages 
about equally intermediate between Sanskrit 
and the two modern Indo-European lan- 
guages in its features. 

4.1. The change in structure from early 
to late Indo-European is not only consistent 
but rather drastic. In eight out of ten 
features, the rank of Sanskrit is in the 
first half (ranks 1-4) of all the languages 
considered: these are the first eight indices 
in Table 4. In six of these eight, Anglo- 
Saxon also ranks in the upper half, though 
usually lower (five times lower, once the 
same, twice higher) than Sanskrit. But as 
regards these same eight features, both 
Persian and English seven times out of 
eight fall in the lower half (ranks 5 to 8). 

4.2. In the two remaining features, the 
last in the table and marked by asterisks, 
the ranking of Sanskrit is 6, of Anglo-Saxon 
5 and 7, but Persian and English are in the 
UPPER half, Persian in rank 3 and 3, English 
in 4 and 2. This exceptional direction of 
change is indicated by the musical sign for 
crescendo (higher index, indicated by rank 
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order nearer 1) placed between the columns 
for ancient and for modern Indo-European. 

Similarly, the eight majority traits are 
divisible into two subgroups, the first five 
and the next three in Table 4. The first 
five are those most immediately concerned 
with complexly synthetic words, and in all 
of these Sanskrit ranks higher than Anglo- 
Saxon as well as the two contemporary 
languages. In the next three traits, however, 
(3c, 5c, 5b), Anglo-Saxon ranks a bit higher 
than Sanskrit, or is tied with it. These 
three traits are Gross Inflection, Pure 
Inflection, and Concord. These have asso- 
ciations with word synthesis but are not 
directly expressive of it. This statement 
might be logically challenged for Gross 
Inflection (I/W); but it holds true at least 
quantitatively in that inflectional mor- 
phemes on the whole constitute lighter or 
smaller increments (containing fewer pho- 
nemes) than derivational or root morphemes 
or prefixes and suffixes (4a, 4b) as positional 
overall classes of morphemes. 

4.3. On the contrary, the two traits 
which have been accentuated in Indo- 
European with the passage of time, as 
indicated by higher rank order, are Isolation 
and Agglutination. Isolation refers to fre- 
quency of syntactical nexus being achieved 
through the orpER of isolated root words. 
Agglutination is of course not a form of 
isolation because it refers to a particular 
kind of combination of morphemes within 
words. But it indicates combination of the 
loosest kind, without (or with minimal) 
morphophonemic change, so that the ele- 
ments are merely glued together in juxtaposi- 
tion instead of being welded by morpho- 
phonemic process, and therefore are readier 
to be shaken apart into isolated units. It is 
thus clear that Table 4 adequately expresses 
the tendency, long intuitively recognized in 
Indo-European, and sometimes expressed, 
for the later phases of these languages to 
approach the isolating, analytical character 
of the East Asian languages, and of Polyne- 
sian within Malayo-Polynesian. 
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5. This previous recognition raises the 
question of why one should bother at all 
with numerical expression if the phenomena 
can be recognized by intuitional thought- 
processes, that is, verbal and non-numerical 
thinking. 

The answer is that when broad problems 
of comparison of multiple phenomena are 
undertaken, answers that are both broad 
and specific can ordinarily be given only as 
based on some sort of quantitative evalua- 
tion. The unaided intuitive mind can carry 
and balance simultaneously only so and so 
many variable phenomena, as is necessary 
in order to ascertain their even more numer- 
ous interrelations which underlie the organ- 
ized classification and then interpretation 
of the phenomena. The intuitive mind: of a 
conscientious scholar realizes its weakness 
in such larger situations and refrains from 
all but the more overwhelmingly patent 
inductions—from fear of making mistaken 
judgments where the skein becomes more 
tangled. The less conscientious investigator 
falls back on guess work or personal bias or 
ethnocentrism, and gives answers that 
contain errors which obscure the situation 
for later workers. It is the really devoted 
scholar, unwilling to err and best able to 
judge, who is most likely to hang back and 
refrain from broad judgments, particularly 
in the face of comparative problems. What 
a set of indices like Greenberg’s does is to 
enable the conscientious scholar to present, 
by simple counting, arrangement, and 
arithmetical division, unbiased answers 
some degree of probability as to a larger 
number of interrelations. 

It is not in the least a question of pedantry 
through wearing the false plumes of physical 
science. There is no algebraic equation used 
for these indices, not even of the simplest 
kind—nothing but elementary arithmetic a 
learned in grammar school. 

The intuitive scholarly mind remains in 
full control of all critical areas: the setting 
of the problem, the rigging of means, the 
interpretation of results. 
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6. I think the foregoing analysis has 
shown that definitely meaningful results can 
be got from numerical values for ten indices 
as applied to only eight languages. If the 
eight languages were raised to twenty-eight 
or thirty-eight, including say Greek, Latin, 
Russian, Arabic, Hebrew, a Hamitic, 
Dravidian, Indonesian, and Ugric language 
or two, Japanese, Polynesian, a Melanesian 
dialect; from native America say a Uto- 
Aztecan, Mayan, Siouan, Athabascan, Al- 
gonkin, and three or four South American 
representatives, and with several from 
Africa—what illuminating likenesses and 
unlikeness, within genetic groups and across 
them, might not be revealed? And if indices 
could be devised for twenty instead of ten 
traits, the qualitative depth and saturation 
of findings might be doubled or even more 
than doubled. 

As said above, I am planning to return 
ina future paper to the problem of enlarging 
the range of Greenberg’s typological ap- 
proach. I hope I have shown here its 
potential value, even as it stands, for 


ON TYPOLOGICAL INDICES I: RANKING OF LANGUAGES 


177 


contrastive and constructive classification 
of languages. 
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A QUANTITATIVE APPROACH TO 


THE MORPHOLOGICAL TYPOLOGY 


OF LANGUAGE* 


JosepH H. 


CoLUMBIA 


One of the steps which any science must 
take if it is to realize the potentialities of 
the scientific method is to advance beyond 
mere description to comparison and classifi- 
cation of the objects it studies. That linguis- 
tics has taken this step is indicated by the 
very existence of a subject matter called 
“comparative linguistics,” one which is, 
moreover, of respected standing among the 
sciences dealing with man. However, the 
methods of comparative linguistics represent 
but one of two fundamental methods by 
which languages may be compared. The 
second method, which may be called the 
typological, is the subject of the present 
paper. It has had a more checkered career 
than the historic-genetic method which 
characterizes comparative linguistics. It is 
important to distinguish as clearly as 
possible the differences between the two 
methods. Each is legitimate in its own 
sphere, but a confusion between the two 
methods, as when typological criteria are 
employed to establish genetic relationships, 
has had harmful effects in the past. 

The genetic-historical method classifies 
languages into ‘families’ which have a 
common historic origin. A familiar example 
is the descent of the modern forms of 
Romance speech, French, Spanish, Portu- 
guese, Italian, Rumanian, etc., from an 
original unified Latin as the result of changes 
of the same speech form in distinct areas. 
When this diversification has not been so 
remote in time as to remove all evidential 
traces, comparison will reveal characteristic 


* This paper first appeared in the Festschrift 
for Wilson D. Wallis, published by the University 
of Minnesota Press, Method and Perspective in 
Anthropology: Papers in Honor of Wilson D. 
Wallis, edited by Robert F. Spencer (1954). 
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resemblances among languages which have 
such a common origin. What is chiefly 
relevant for such comparisons is the resem- 
blance between individual forms in language, 
both with respect to sound and meaning, 
For example, English nose and German 
Nase have similar sounds and both have 
virtually identical meanings, ‘‘nose”’; English 
hound and German Hund, “dog,” are 
similar in sound and similar though not 
identical in meaning. Any language consists 
of thousands of forms with both sound and 
meaning, the relation between the two 
being arbitrary. In principle, eny sound 
whatever can express any meaning whatever, 
Therefore, if two languages agree in a 
considerable number of such items, as do 
German and English, and if—a problem 
not discussed here—the resemblances can- 
not be explained by borrowing, we necessar- 
ily draw a conclusion of common historic 
origin. Such genetic classifications are not 
arbitrary, in the sense that there is no 
room for the establishment of varied criteria 
leading to different results. This is because 
such classifications reflect historic events, 
which must either have happened or not 
have happened. Either the speakers of 
German and English speak a language 
transmitted from an original unified proto- 
Germanic speech community or they do not. 
The analogy here to biological classification 
is extrernely close as, indeed, the use df 
such terms as family, related, and genet 
might suggest. Just as in biology we classify 
species in the same genus or high unit 
because the resemblances are such as t0 
suggest a hypothesis of common descent, 9 
with genetic hypotheses in language. 

One can, however, compare languages 
which cannot be shown to be related 
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genetically and this either with respect to 
gme features of sound only or meaning 
oly. The two following examples will serve 
at once to illustrate this possibility and to 
show that, in this way, legitimate scientific 
problems arise. All languages must express 
comparison—the fact, for example, that 
me thing is larger than another. If we com- 
pare all the languages of the world in this 
respect, we discover that the number of 
methods is limited—e.g., a special, in- 
fected form of the adjective (English 
greater), the use of a preposition meaning 
from (Semitic), the use of a verb meaning 
surpasses (as widely used in Africa)—and 
that of these some are far more frequent 
than others and have definite geographical 
distributions which disregard genetic bound- 
aries. All these are surely data of real 
interest which require an explanation. The 
phenomenon just discussed is semantic. In 
the area of sound patterns, the independent 
appearance of a system of five vowels with 
two degrees of length—a, a’, e, e', i, i, 0, 
0’, u, u'—in classical Latin, Hausa in West 
Africa, Yokuts, an Indian language of 
California, and doubtless elsewhere—is 
surely worthy of attention. Indeed, such 
phenomena have been studied by 
Trubetskoy and others in an attempt to 
discover what types of vowel and consonant 
systems are possible and their relative 
frequency and area of appearance. 

If genetic comparison establishes classes 
of languages, i. e., language families in the 
accepted sense, does not typological classi- 
fication do this also? The answer, of course, 
is that it can but that, as opposed to genetic 
classification, it has no specific historic 
implications and is arbitrary, i. e., will lead 
to different results depending on the criterion 
or combination of criteria selected. 

In this respect it is like racial classifica- 
tion based on a number of arbitrarily se- 
lected traits. If, for example, we select such 
a purely phonetic criterion as the presence 
or absence of lip rounding as a feature 
distinguishing pairs of vowel phonemes, 
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the languages of the world will fall into 
two groups, those that avail themselves of 
this particular principle of vontrast and 
those that do not. English and Italian, 
which do not, will fall in class A with in- 
numerable other languages; and French 
and German, along with a smaller number 
of other languages from different parts of 
the world, will fall into class B. If some 
other typological trait is selected, say the po- 
sition of the dependent genetive relative to 
the noun, again two classes of languages will 
emerge, but they will not coincide with 
those obtained by the criterion of rounded 
versus unrounded vowel. Taking both 
factors into consideration, there will be 
Certain characteristics will 
allow of more than two classes. If languages 
are classified semantically on the basis of 
their numerical system, we shall obtain a 
number of classes of languages: binary, 
quinary, decimal, duodecimal, and no doubt 
others as well. In short, in contrast to 
genetic classification, the number of lan- 
guage groups and their membership will 
differ with the number and particular 
selection of linguistic traits utilized for the 
comparison. At one extreme, if some such 
feature as the presence of a vowel system 
is employed as the sole criterion, the lan- 
guages of the world will fall into two groups, 
those with (and this group will include all 
the languages of the world) and those 
without. This latter class will, of course, 
have no member. At the other extreme, one 
can specify so many traits that each lan- 
guage becomes the only member of a specific 
type. 

Many such classifications, as should be 
evident from the examples just cited, are 
not very useful. We seek to establish typolo- 
gies which involve characteristics of funda- 
mental importance in language and which 
are useful for a variety of reasons. Such a 
classification does exist, the nineteenth- 
century division of languages which, in its 
classical version, was threefold: isolating, 
agglutinative, and inflective types. 
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This trop fameuse classification, to use 
Meillet’s trenchant phrase, betrays fun- 
damental weaknesses which have inevitably 
led to its present state of disrepute. Yet the 
problem itself 7s of sufficient importance for 
Sapir, in the only major contribution to 
the subject since the nineteenth century, to 
have made it the central topic of his book 
Language. However inadequate the nine- 
teenth-century discussion of this topic now 
appears, the major virtue of the schemes 
advanced remains. Something of funda- 
mental importance to the over-all 
characterization of a language, the morpho- 
logical structure of the word, was instinc- 
tively seized upon as the basis of classifica- 
tion, and Sapir’s contribution simply 
continued this essential feature of the 
earlier approach, in revised form. Other 
typologies, particularly phonological, are 
possible, and these latter have been the 
chief center of interest in typological 
discussions in recent years. However, the 
problem of a morphological typology re- 
mains, as evidenced by Rulon Wells’s 
recent statement, “It is shocking that 
there is no established taxonomy of the 
languages of the world” (1950, p. 31). The 
revived interest in this topic is further 
evidenced by the virtually simultaneous 
and independent development by Charles 
Hockett of Cornell and myself of the ideas 
presented in this paper.! In short, the time 
seems propitious to re-examine the nine- 
teenth-century approach to the problem, 
discarding what the intervening period of 
linguistic criticism has demonstrated as 
invalid and incorporating recent methodo- 
logical advances in order to reformulate the 
hypotheses along more rigorous lines. A 
brief critical review of earlier attempts may 
serve to place the present treatment in its 
proper perspective. 

1The same basic approach was presented by 
Charles Hockett in an-unpublished paper read at 
the annual meeting of the Linguistic Society of 
America, Philadelphia, 1949, and in a talk I gave 


before the Linguistic Circle of New York at 
Columbia University, January, 1950. 
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The germ of all later classifications js 
found in the distinction, first set forth by 
Friedrich von Schlegel in his essay “Ueber 
die Sprache und Weisheit der Inder” 
(1808), between languages with affixes and 
languages with inflections. The valuational 
attitudes so prominent throughout the 
later history of the theory are already 
present in this earliest formulation. The 
affix-languages express relations through 4 
merely mechanical process. In a striking 
figure they are likened to a “heap of atoms 
which every wind of chance scatters o 
sweeps together” (p. 51). Only Indo. 
European languages are inflectional, but 
Semitic is tending in that direction. 

This twofold classification was elaborated 
into a tripartite one by Friedrich’s brother, 
August von Schlegel. In an essay “Sur la 
littérature provengale” (1818), Schlege 
describes three classes of languages: “‘an- 
guages without grammatical structure, 
affixing and inflectional languages” (p. 559). 
Of his first class, called by later writer 
isolating or root languages, he says, “‘One 
might say that all their words are roots, but 
sterile roots which produce neither plants 
nor trees” (p. 159). To create a literature a 
science in such a language is a tour de force, 
The affixing languages use added elements 
(‘‘affixes’’) to express the relations and 
nuances of the root ideas, but these affixes 
still have an independent meaning. Re 
garding inflectional languages in whic 
such affixes are meaningless (i.e., devoid o 
concrete meaning), we learn that to then 
can be attributed a kind of organic life 
(“organisme”’) in that “they contain a vital 
principle of development and _ growth’ 
(p. 159). As with later writers, Chinese is 
his model of a root-language; only the 
Semitic and Indo-European languages be 
long to the inflectional group; all the other 
belong to the vast and _ heterogeneous 
intermediate or agglutinative class. The 
inconvenient fact that the Indo-European 
languages have tended to lose their inflection 
bothers him as it did other later writers. He 
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therefore introduces a further subdivision 
of his inflectional languages, the earlier 
synthetic and later analytic. All analytic 
languages we know result from the decom- 
position of synthetic languages. Passing 
over Other writers who discussed the topic 
in much the same terms as Schlegel, we 
come to Wilhelm von Humboldt who, in his 
essay ‘Ueber die Verschiedenheit der 
Menschlichen Sprachen” (1836), placed 
this type of analysis at the very heart of 
his approach to language. Von Humboldt 
viewed each language as a distinct self- 
revelation of the spirit (Geist). Such self- 
revelations, while each a valid expression 
in its own right, exhibit lesser or greater 
degrees of perfection. There are four classes 
of languages in von Humboldt’s scheme. 
He adds a fourth, incorporating type to 
the by now traditional threefold classifica- 
tion in order to accommodate certain 
American Indian languages whose very 
complex word-patterns include instances in 
which the object of a verb is incorporated 
in the same word as the verb root. Von 
Humboldt is explicit in rejecting any histor- 
ical evolutionary interpretation in which 
higher types evolve out of lower types. 
These are ideal types involving different 
degrees of the unfolding of form. The 
isolating languages are “formless,” the 
incorporating languages, through their over- 
elaboration, betray no true sense of form. 
As might be expected, only the inflected 
languages, by their harmonious fusion of 
root and affix in a true unity, are credited 
with a true sense of form. 

The definitive exposition of the theory is 
found in the writings of A. Schleicher, who 
stood under the twin influences of Darwin 
and Hegel. The classes of languages are now 
interpreted as so many historical evolu- 
tionary phases of the development of 
languages. The types are limited to three, 
which are equated to the stages of the 
Hegelian dialectic. The inflectional class of 
languages becomes a_ higher synthesis 
arising out of a previous opposition. The 
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decay of inflection in historic times intro- 
duces a new phase of the Geist in which the 
material side of language no longer counts. 
In accord with the intellectual currents of 
the age, and equipped with an impressive 
set of quasi-algebraic formulas to indicate 
the various relations of the root to subordi- 
nate elements, Schleicher’s version of the 
theory won wide acceptance and provided 
the basis for the exposition by those two 
great popularizers of linguistic science, Max 
Mueller in Europe and Dwight Whitney in 
the United States. Later versions such as 
Steinthal-Misteli, which added to the 
complications of the scheme without any 
compensating advantages, never became as 


‘popular as Schleicher’s, which thus es- 


tablished itself as the basic form of the 
theory. 

During this whole period criticism of the 
ethnocentrism and vagueness of these 
typologies was by no means lacking. To cite 
but one instance, Whitney, who was by no 
means as enthusiastic about the Schleiche- 
rian typology as his European contemporary, 
Max Mueller, declares ‘loved from love is as 
good a preterite as led from lead or sang 
from sing” (1876, p. 362). Loved is, of 
course, an example of agglutinative tech- 
nique, while led and sing are inflective. Still, 
in regard to isolating languages he talks of 
“this lack of resources possessed by more 
happily developed languages; ... thought 
is but brokenly represented and _ feebly 
aided by its instrument.” Others, particu- 
larly at a later date, were highly critical or 
even contemptuous as, for example, Mauth- 
ner, according to whom “... the valuation 
[of languages] according to whether their 
inflections are more or less transparent is 
as foolish as if one judged the merit of 
European armies according to the greater 
or lesser visibility of their trouser seams” 
(1923, p. 309). 

Sapir’s treatment of the topic in his 
book Language (1921) marks an epoch. He 
firmly rejects both the valuative and 
evolutionary aspects of the theory. There 
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is no real reason to suppose that Chinese or 
Hungarian is not as effective an instrument 
of thought as Latin or English. ‘‘When it 
comes to linguistic form, Plato walks with 
the Macedonian swineherd, Confucius with 
the head-hunting savage of Assam”’ (p. 234). 
Man must have had language for at least 
500,000 years; hence, if there is indeed a 
line of development—isolating, agglutina- 
tive, inflective—present-day isolating lan- 
guages cannot possibly represent the primi- 
tive stage. Indeed, the evidence of human 
paleontology and glacial geology had made 
this assumption long since untenable. 
Chinese, the classic instance of an isolating 
language, was known by Sapir’s time to 
have formerly possessed a more complex 
morphological system, both from earlier 
records of the language and comparison 
with Tibetan and other related languages. 
Perhaps more significant than these no 
longer tenable assumptions were other 
logical defects which had been pointed out 
from time to time. The distinctive criteria 
of the various types had never been either 
defined with clarity or applied with ob- 
jectivity. In reading Steinthal-Misteli, one 
has the feeling that the writer is playing 
with loaded dice. Whenever, on the basis 
of his own showing, a non-Aryan, non- 
Semitic language appears to have a praise- 
worthy feature, a tacit shift in definition 
shows that ‘true form” or “true inflection” 
is not involved. The definitions were not 
only vague but partly referred to quite 
different things, so that a language might 
well belong to several of the supposedly 
mutually exclusive classes simultaneously. 
For example, agglutination was usually 
taken as referring to a technique of mechan- 
ical affixation, so that, in Max Mueller’s 
words, ‘‘The difference between an Aryan 
and Turanian is somewhat the same as 
between good and bad mosaic. The Aryan 
words seem made of one piece, the Turanian 
words clearly show the sutures and fissures 
where the small stones are cemented to- 
gether” (1890, p. 292). The contrary term 
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should be inflection. But the term inflection 
was also used to indicate the presence of 
affixes without concrete meaning to denote 
relations among words of the sentence; e.g,, 
by case endings in the noun or person- 
number terminations in the verb. On this 
basis Turkish is both agglutinational (on 
the basis of technique) and _ inflectional 
(because of its case and verb conjugational 
systems). 

A further defect, as was pointed out by 
some critics, was that a language had to be 
assigned to a single category, although the 
features employed might be present to a 
great or less degree. A term like agglutinative 
applies primarily to a single construction. 
A language may well and indeed usually 
does contain some agglutinational as well 
as some nonagglutinational constructions. 
In other words, it is a matter of over-all 
tendency rather than absolute presence or 
absence of the diagnostic traits. Sapir, by 
distinguishing among the various criteria 
which were unconsciously being employed 
in a confused way in the classical theory, 
constructs a more complex system in which 
languages are classified by a number of 
independent criteria and in which the 
traditional terms are retained, but in well- 
defined uses which are often on different 
axes so that they are no longer mutually 
exclusive. 

One such axis distinguished by Sapir may 
be said to relate to the gross complexity of 
the word, i.e., the degree of complexity 
exhibited on the basis of the number of 
subordinate meaningful elements it con- 
tains. The terms employed here by Sapir 
are analytic, synthetic, and polysynthetic, in 
ascending order of complexity. The theoretic 
extreme of analysis is represented by lan- 
guages in which each word consists of only 
one meaningful unit and thus has no internal 
structure. Languages which actually ap- 


proach this extreme are Chinese, Annamite, | 
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mother use. Languages like English with 
yords of little complexity were included by 
Sapir in the analytic group. However, the 
degree of synthesis is but one criterion and 
arelatively superficial one, since it does not 
tll us in what the complexity of the word 
onsists. A second and quite different 
onsideration refers to technique of con- 
struction. The contrast is, roughly, between 
languages in which the subordinate elements 
are added to the root elements mechanically, 
ie, Without either of the elements being 
modified (the most common meaning of 
agglutination in the classical scheme), and 
those involving a process of fusion by which 
the constituent elements become difficult 
to recognize and separate. To quote Sapir’s 
examples: good + ness in English is aggluti- 
native, dep + th, fusional. After treating 
this topic several times with varying con- 
dusions, Sapir finally sets up a fourfold 
division: a, isolating; b, agglutinative; c, 
fusional; d, symbolic. Isolation, by which 
Sapir means significant order of elements, is 
included here because he thinks of this 
rubric as concerned with techniques of 
relating elements. Just as we know that in 
‘John hit Bill” “John” is in construction 
with “hit’’ as the subject of the verb by the 
fact that it precedes the verb, so in dep-th, 
the modification of deep to dep- indicates 
that it is in a construction with -th. Since 
the agglutination-fusion contrast refers 
merely to the mechanics of expression 
rather than to what is being related, Sapir 
considers this scale likewise somewhat 
superficial, though useful as a supplementary 
criterion. 

The division which appears to Sapir the 
most basic rests on the following consid- 
erations. There are two types of concepts 
which all languages must express, a stock of 
roots with concrete meanings, e.g., table, 
eat, and pure relational ideas which “serve 
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the two extremes of concrete and abstract, 
Sapir puts these two classes at the ends of 
his scale as I (concrete), IV (pure rela- 
tional). Between these, he places two groups 
of concepts which are dispensable, since 
some languages have them and others do 
not. Class II consists of derivational con- 
cepts which “differ from type I in expressing 
ideas irrelevant to the proposition as a 
whole but that give a radical element a 
particular increment of significance and 
that thus are related in a specific way to 
concepts of type I.” Sapir’s example is 
the suffix -er in farmer, which gives an 
increment of meaning to farm- but is not 
concerned with the structure of the rest of 
the sentence. A language which does not 
have concepts of type II would employ a 
single unanalyzable element to convey the 
meaning farmer. Concepts of type III 
(concrete relational concepts) lead over to 
type IV (pure relational), insofar as they 
help to relate members of a sentence to 
each other, but differ in that they have an 
element of the concrete in their meaning. 
Examples are the elements indicating 
gender in a language like German. The -er 
of d-er in “der Bauer tétet das Entelein” 
(the farmer kills the duckling) relates d- 
to Bauer by agreement in number, gender, 
and case. It thus indicates that d- modified 
Bauer and that Bauer is singular and 
subject of the sentence. It does this, how- 
ever, with the material alloy of the indica- 
tion of sex gender, in this case, masculine. 
Such concepts are called by Sapir concrete- 
relational. Since we have four types of 
concepts, of which I and IV are necessary 
in all languages, but II and III are dis- 
pensable, we set up the following four classes 
of languages: A consists of those with I and 
IV only. These Sapir calls Simple Pure- 
Relational Languages, e.g., Chinese. Group 
B contains those with II as well as the 
indispensable I and IV. These are Complex 
(i.e., Deriving) Pure-Relational Languages. 
Group C is constituted by languages which 
have I, III, and IV, but not II. These are 
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Simple Mixed-Relational Languages. In 
group D, finally, we have those languages 
which have all four types of concepts, i.e., 
Complex Mixed-Relational Languages. Of 
the two considerations, the presence or 
absence of II (derivational) or presence 
and absence of III (concrete-relational) 
Sapir considers the latter, which aligns 
types A and B versus C and D, as the more 
fundamental. In his final table of classi- 
fications, Sapir takes a number of languages 
and, employing the sets of criteria already 
discussed, first assigns a particular language 
to one of the “fundamental types” A, B, 
C, and D just mentioned. He likewise 
indicates degree of synthesis. 

The third factor, technique, is specified 
separately for each of the groups of con- 
cepts II, III (when they are present), and 
IV, using the above mentioned scale: a, 
isolating; b, agglutinative; c, fusional; d, 
symbolic. Sapir often mentions two and 
sometimes three techniques. If weakly 
developed, he encloses the symbol in paren- 
theses. He then gives an over-all estimate 
of the dominant technique of the language, 
often using compound terms such as aggluti- 
native-fusional to indicate roughly equal 
use of these two techniques. Note that 
Sapir does not use the term inflectional 
anywhere in his finished classification. He 
defines inflection as the use of fusional 
techniques in the sphere of inflectional 
units. He believes that, so defined, it is not 
of sufficient importance to figure as a funda- 
mental term in his classification. The pres- 
ence of inflection is, therefore, indicated by 
the appearance of c, indicating fusion, or c 
and d, indicating fusion and symbolism, in 
association with the concepts of group III 
(mixed-relational). As an example of Sapir’s 
total scheme, let us take his classification of 
Semitic. These languages are assigned as a 
group to D, i.e., complex-mixed relational 
languages with all four types of concepts 
present. They are synthetic. In the area of 
derivational concepts (II) the techniques 
are listed as d, c, in that order, i.e., symbolic 
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and fusional. In the area of mixed-relational 
concepts (III) the techniques are given a 
c, d, fusional, symbolic. Under IV the 
technique is (a), isolation, i.e., significant 
word order, the parentheses indicating its 
weak development. There is finally an over. 
all estimate of technique as symbolic. 
fusional. 

The present method utilizes the Sapir 
classification in revised form. The basic 
criticisms of Sapir, already voiced by Most 
(1948, pp. 183-190), boil down to two. The 
first and most significant is that in his 
classification into four basic types, Sapir 
seems to be talking about concepts, but in 
reality his test is a formal, not a semantic 
one, a fact which leads to some difficulties 
in exposition. For example, Sapir discuss 
the concept of the plural, which he consider 
highly abstract. However, it can be assigne( 
anywhere along the scale I-IV in a particula 
language, as he points out. His test, then, a 
to whether the plural is a root concept (I),: 
derivational one (II), or relational (III o 
IV) depends upon which formal class a 
particular language assigns it to. Sapir 
recognizes this disparity. “It is because our 
conceptual scheme is a sliding scale rather 
than a philosophical analysis of experienc 
that we cannot say in advance just where to 
put a given concept” (1921, p. 117). In the 
typology presented here, the approach i 
formal. It is recognized that there is, in 
fact, a tendency for meanings of root 
morphemes (Sapir’s I) to be more concrete 
in meaning than derivational (Sapir’s II) 
or inflectional morphemes (III or IV), but 
this is too vague to constitute a valid 
procedure. Here, as elsewhere in modem 
linguistics, we isolate our distinctive units 
by a formal, not a semantic, test for purely 
practical reasons. The second criticism re 
lates to Sapir’s scale: a, isolating; b, aggluti- 
native; c, fusional; d, symbolic. Isolating is 
a technique of relating, as are the other 
devices, but applies almost only between 
words, relative order being rarely significant 
within the word. It is therefore out of place 
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here, and this betrays itself in the asym- 
metry of its occurrence in Sapir’s scheme, 
for it only appears as a technique under IV 
(pure relational concepts) and does not 
refer to relations within the word as with 
the other techniques, but between words. 

The method of classification proposed here 
is fundamentally that of Sapir, with certain 
modifications in the light of these criticisms. 
Moreover in place of intuitive estimates 
based on over-all impressions, an attempt is 
made to define each feature involved in this 
classification in terms of a ratio of two 
units, each defined with sufficient rigor and 
by the calculation of a numerical index 
based on the relative frequency of these 
two units over stretches of text. Five bases 
of classification are set up in place of Sapir’s 
three, and a set of one or more indices are 
established to measure the place of any 
language in regard to each of them. The 
first of these parameters is the degree of 
synthesis or gross complexity of the word. 
Since Sapir’s time the minimum meaningful 
sequence of phonemes in a language has 
come to be called the morpheme in American 
linguistics. For example, the English sing-ing 
contains two morphemes but forms one 
word. The ratio M/W where M equals 
morpheme and W equals word, is a measure 
of this synthesis and may be called the 
synthetic index. Its theoretical lower limit 
is 1.00, since every word must contain at 
least one meaningful unit. There is no 
theoretical upper limit, but in practice 
values over 3.00 are infrequent. Analytic 
languages will give low results on this 
index, synthetic higher, and polysynthetic 
the highest of all. 

The second parameter refers to technique. 
At one extreme are languages in which the 
meaningful elements are joined with slight 
or no modification. This is the classic 
meaning of agglutination. The opposite 
phenomenon is reciprocal modification or 
merging of the elements. Actually a number 
of types of constructions can be distinguished, 
and thus a more subtle typology might be 


QUANTITATIVE APPROACH TO MORPHOLOGICAL TYPOLOGY OF LANGUAGE 


185 


developed. For purposes of the present 
paper an alternative is selected which seems 
to correspond most closely to the intent of 
Sapir and of the usual nineteenth-century 
analyses. To use contemporary terminology, 
what is involved is the degree of morpho- 
phonemic alternation. The meaningful 
stretch actually found in an utterance is 
called a “morph.” A number of related 
morphs are subsumed under a single funda- 
mental unit, the morpheme. The various 
morphs are then said to be in alternation. 
For example, in English we relate the 
morph lijf (“leaf”) with the morph lijv- 
which only occurs with the plural morph -z 
to form lijvz (“leaves”). Lijf and lijv- are 


morphs which alternate within the same 


morphemic unit. The rules for the statement 
of this alternation belong to the morpho- 
phonemic section of the description of 
English. Where there are no variations 
among the constituent morphs of a mor- 
pheme or where the variations which exist 
are all automatic, the morpheme itself is 
said to be automatic. By an automatic 
alternation is meant one in which all the 
alternants can be derived from a base form 
by a set of rules of combination that holds 
in all similar instances throughout the 
language. This matter will be discussed in 
greater detail below. If both morphs in a 
construction belong to morphemes which 
are automatic, the construction is called 
agglutinative. 

The index of agglutination is the ratio of 
agglutinative constructions to morph junc- 
ture. There is necessarily always one less 
morph juncture in a word than the number 
of morphs. Thus leaves has two morphs but 
one morph juncture. The index of aggluti- 
nation is A/J, where A equals the number 
of agglutinative constructions and J equals 
the number of morpheme junctures. A 
language with a high value for this index will 
be agglutinational, one with a low value 
fusional. In general, the lower the first, or 
synthetic, index, the fewer the morph 
junctures which occur and the less the 
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importance of this second index in charac- 
terizing the language. If a language reaches 
the theoretic lower limit of the synthetic 
index 1.00, this second index becomes im- 
possible of calculation because no morpheme 
junctures occur; that is, the agglutinational 
index becomes 0/0, which is meaningless. 
In calculating this index the differences 
between the degree of agglutination which 
might be found in constructions involving 
Sapir’s concepts of groups II, III, and IV 
and which, as we have seen, figures in 
Sapir’s final formulation, have been ignored. 
Such indices might be calculated on the 
basis of the distinctions between root, 
derivational, and inflectional morpheme 
classes described below, since it is these 
features which correspond most closely to 
Sapir’s division of concepts. They were not 
set up, partly in order to avoid too great 
over-all complications in the typology, 
partly because of the considerable labor 
involved in their calculation. 

The third parameter corresponds most 
closely to what for Sapir was the most 
fundamental basis of classification in lan- 
guages, the presence or absence of deriva- 
tional and _ concrete-relational concepts. 
Since, as has been seen, an approach in terms 
of the meaning of concepts is too vague to 
handle with any degree of rigor, the present 
treatment is based on the possibility of the 
exhaustive division of morphemes into three 
classes, root, derivational, and inflectional. 
Every word must have at least one root 
morpheme and many words in many 
languages have no more. The existence of 
more than one root morpheme in a word is 
called compounding. This is a significant 
feature in which languages differ from each 
other considerably. There are languages 
which cannot compound at all, or only very 
sparingly. Others again compound freely, 
while most languages are somewhere be- 
tween. It is remarkable that Sapir nowhere 
seems to take this into account in his typol- 
ogy. It can easily be measured by a com- 
positional index R/W, where R_ equals 
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number of root morphemes and W equals 
number of words. The second class of 
morphemes is the derivational. Examples 
of derivational morphemes in English are 
re- in re-make, -ess in lion-ess, -er in lead-er, 
The derivational index is D/W, the ratio 
of derivational morphemes to words. Lan- 
guages with a high D/W will belong to 
Sapir’s Complex or Deriving subtypes and 
will thus fit into classes B or D in his 
scheme. The inflectional morphemes are the 
third class. Examples in English are -s of 
eats and the -es of houses. 

The inflectional index is I/W, i.e., the 
proportion of inflectional morphemes to 
words. This does not quite cover Sapir’s 
concepts of type III (concrete relational), 
as will be shown. However, a language which 
has these concepts and so belongs to Sapir’s 
mixed-relational types C and D necessarily 
has a fairly high value for the inflectional 
index; the inverse relation does not neces- 
sarily hold. 

The fourth parameter refers to a topic 
discussed by Sapir as important for the 
morphological structure of a language but 
not included in his final formulation. This 
is the order of subordinate elements in 
relation to the root. The main distinction 
here is between the use of prefixes and 
suffixes. The prefixial index P/W, is the 
ratio of prefixes to the number of words, 
and the suffixial, S/W, is the ratio of suffixes 
to the number of words. An index of in- 
fixing, that is, of the number of subordinate 
elements which are incorporated within the 
root could likewise have been calculated, 
but the occurrence of infixes was so rare in 
the particular languages investigated that 
it seemed justifiable to omit it. There are 
an indefinite number of further types of 
relative position of subordinate elements to 
the root, for example, containment, as in 
the Arabic imperfective prefix of the second 
person feminine which surrounds the verb 
morpheme, e.g. tagtuli:, “thou (fem) 
killest,”’ in which the second person feminine 
morpheme is ta----i- while “kill” is -q-t-l and 
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“imperfect tense” is -u-. There is likewise 
intercalation, again found in Semitic, in 
which a portion of the subordinate element 
precedes or follows the root and another 
portion is incorporated. All these devices 
are so rare that, at least for the languages 
treated, it did not seem worth while to 
calculate indices. Sapir’s symbolism, which 
he considers a kind of technique along with 
isolation, agglutination, and fusion, really 
belongs here. Sapir’s symbolism or internal 
change is, in my opinion, simply the infixing 
of an inflective element, e.g., the preterite 
-a- in sang. When the elements are deriva- 
tional, as in Indonesian languages, the 
process is usually called infixing. This brings 
out the fact that there are two distinct 
considerations involved in Sapir’s use of 
the term symbolism, position and regularity. 
An infixing process might well be regular, 
in which case the construction would be 
agglutinative. In fact, though, this hardly 
ever occurs. 

The final parameter has to do with the 
devices employed for relating words to each 
other. It therefore brings in syntactical as 
well as morphological considerations. There 
are three devices that languages may use, 
inflectional morphemes without concord, 
significant order, or concord (agreement). 

Languages using the first two devices 
belong to Sapir’s pure-relational category, 
while those using concord are mixed-rela- 
tional. The inflectional index discussed 
above would include both nonconcordial 
and concordial inflectional morphemes. This 
index, which may be called the index of 
gross inflection, is therefore of limited use 
for the present problem. It might seem that 
by distinguishing between concordial and 
nonconcordial inflectional morphemes and 
by assigning the words without inflectional 
morphemes to the isolating class, it would 
be possible to make a clear threefold divi- 
sion. The extent of isolating, inflectional, 
and concordial techniques could then be 
calculated by three indices based on the 
tatio of each of these types to the total 
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number of words. There are a number of 
complications that prevent such a simple 
procedure. Many languages, for example, 
Latin, merge concordial and nonconcordial 
features in the same inflectional morpheme. 
Thus, the -um of the Latin masculine 
accusative singular of adjectives has two 
concordial features, gender and number, 
and a pure inflectional one, case. In such 
cases, the procedure adopted is to count 
the same morpheme a number of times, 
one for each distinct feature. Another dif- 
ficulty arises in regard to order. Order 
probably always has some value in relating 
elements even where inflection exists. We 
relate the accusative to the nearest verb 
even where the order is not fixed. Order 


‘may even be fixed although other means 


are present to indicate which words are in 
construction. This is largely true of German, 
for example. Significant order should perhaps 
be restricted to cases where a change of 
order produces a change of constructional 
meaning. The criterion employed was 
closest to this latter one, but easier of 
application. The absence of an inflectional 
morpheme in a word was taken as an 
indication that the method of relating it 
was order. If we call each instance of the 
use of a principle to indicate relations 
between words in the sentence a nexus, 
then three indices, O/N, Pi/N, and Co/N 
were calculated, where O = order, Pi = 
pure inflection, Co = concord, and N = 
nexus. 

The following, in summary, are the 
typological indices which have been de- 
scribed: 


(1) M/W = index of synthesis 
(2) A/J = index of agglutination 
(3) R/W = compounding index 
(4) D/W = derivational index 

(5) I/W = gross inflectional index 
(6) P/W = prefixial index 

(7) S/W = suffixial index 

(8) O/N = isolational index 

(9) Pi/N = pure inflectional index 


(10) Co/N = concordial index 
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The validity of these indices assumes 
that we can define the units employed 
consistently and in such a manner that 
they may be applied to all languages. In 
fact, there is hardly one of the units em- 
ployed in the above formulas which does 
not admit of a number of alternative 
definitions. The choices made here are 
dependent on the particular purposes of the 
study. We always ask what it is that we 
want to measure. In certain cases there 
seems to be no good reason for the choice of 
one alternative over the other from this 
point of view, and a purely arbitrary choice 
was made since some basis of decision had 
to be reached. It may be of some comfort 
to note that the theoretically wide range of 
choice of definitions for certain units only 
bore on decisions for a relatively small 
proportion of difficult instances. As evidence 
of this, the results of the indices calculated 
for a passage of 100 words of English in 
1951, and arrived at by methods no longer 
fully recoverable by introspection, may be 
compared with indices for a 100-word 
passage done recently in accordance with 
the methods outlined here. 


1951 1953 
i i kg ures 1.62 1.68 
Agglutination..........  .31 30 
Compounding.......... 1.03 1.00 
ere 1.04 
i ie us wih nid are 50 64 
Gross inflection......... 64 59 


It should be emphasized that other 
alternatives than those chosen here for the 
definition of units are equally possible and 
probably preferable for certain other pur- 
poses, for example, writing a grammar of a 
language. 

In the following section the chief problems 
encountered in defining the units employed 
in the indices are discussed. These refer to 
the morph, the morpheme, agglutinative 
constructions, the distinction of root, 
derivational and inflexional morphemes, and 
the word. Nothing approaching an exhaus- 
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tive treatment of the problems is attempted 
here. The purpose of the present discussion 
is merely to point out the chief problems 
encountered in this study and the reasons 
for the particular solutions which were 
adopted. 

Basic to the synthetic index as well as 
most of the others is the possibility of 
segmenting any utterance in a language 
into a definite number of meaningful 
sequences which cannot be subject to further 
division. Such a unit is called a morph, 
There are clearly divisions which are 
completely justified and which every analyst 
would make. For example, everyone would 
divide English eating into eat-ing and say 
that there were two units. There are other 
divisions which are just as clearly unjustified, 
For example, the analysis of chair into ch, 
“wooden object,” and -air, “something to 
sit on,’ would be universally rejected. 
There is, however, an intermediate area of 
uncertainty in which opinions differ. Should, 
for example, English deceive be analyzed 
into de- and -ceive? It is this intermediate 
area with which we must be able to deal. 
We start with a set of forms that will be 
hereafter called a square. A square exists 
when there are four meaningful sequences 
in a language which take the form AC, 
BC, AD, BD. An example is the English 
eating : sleeping :: eats:sleeps, where A is 
eat-, B is sleep-, C is -ing and D is -s.? Where 
a square exists with corresponding variation 
of meaning, we are justified in segmenting 
each of the sequences of which it is com- 
posed. Once it has been segmented, each of 
its segments may then be tested to discover 
if it also is a member of a square. If it is, it in 
turn will be segmented into two morphs. 
If it is impossible, then we have reached the 
limit of analysis and cannot divide further. 
A test of correspondence of meaning is 


2QOne of the four elements may be zero pro- 
vided the sequences in which it occurs are free 
forms, i.e., may occur in isolation. For example, 
hand:hands :: table:tables is a valid square, in 
which A is hand, B is table, C is zero, and D is —8. 
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applied to avoid such squares as hammer: 
ham :: badger:badge. A square conforming 
to these conditions described will always 
give us valid, generally acceptable analyses. 
It is too severe, however, in that it excludes 
some segmentations which everyone would 
want to accept. The first extension which we 
make is the following. A sequence which 
occurs with a member of a square is also 
recognized elsewhere if with regard to this 
member (a) the sequence of phonemes is 
identical except for automatic changes (for 
which see below) and (b) if the meaning is 
the same. On this basis we recognize a 
segmentation of huckleberry into huckle 
plus berry, since berry itself is a morph 
elsewhere. This leaves huckle- 
morph although it never occurs in a square. 
If now huckle- were to occur in some other 
combination, we would recognize a segmen- 
tation there also and so add a new morph. 
This process is continued until we reach a 
sequence that does not recur in any other 
combination. In this instance we have 
reached it with huckle-. 

A further extension must be made in the 
case of what might be called a formally 
defective square. We should like to analyze 
men into two morphs, one with the meaning 
“man” and the other “plural,” but there is 
no square into which it can be put. For 
example, man:men :: boy:boys is formally 
defective. We set up the following rule. If 
there can be found a square like the one 
just cited in which boy: boys itself is a pair in 
another valid square, e.g., boy:boys :: lad: 
lads, and if man may always substitute for 
boy and men for boys and produce a gram- 
matical (even though at times semantically 
improbable) sentence, then man:men may 
be subjected to a segmentation analo- 
gous to that of boy:boys, and men may be 
considered two morphs. In the case of sheep: 
sheep :: goat: goats we recognize two morphs 
for the plural of sheep, one of which is a 
zero. Such analyses are not to be confused 
with into two or more 
semantic categories, where no valid square 


segmentations 
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exists for substitution. In Latin, for example, 
we cannot analyze -us nominative singular 
into two morphemes, nominative and 
singular. The square -uws:6: :7:is—nomina- 
tive singular:dative singular: : nominative 
plural:dative plural—does not have a 
pair which can be substituted for members 
of a formally perfect square, and hence the 
segmentation of these forms is not permitted. 
Corresponding to formally defective squares 
we have those which are semantically 
defective. Here, if there are parallel non- 
automatic variations, the analysis is per- 
mitted even though definite meanings 
cannot be assigned to the morphs. Thus, 
the sets deceive:receive: :decep-tion :recep- 
tion: :decei-t:recei(p)t justify the segmenta- 
tions de + ceive and re + ceive. This rule 
permits the usual assumption of morphs for 
derived forms of the verb in Semitic. In 
view of the variety of meanings in instances 
of this kind, it would be difficult to operate 
without it. 

There are certain extensions of the morph 
concept which are rejected here as incon- 
sistent with the purposes of this study, 
although entirely legitimate for other 
purposes. No discontinuous morphs which 
contain segments in two different words are 
accepted. This is understandable, since we 
wish to measure the ratio of morphemes to 
words and we therefore want each word 
to contain a definite number of morphemes 
restricted to the word itself. We likewise do 
not include meaningful units simultaneous 
with grammatical stretches longer than the 
word, e.g., intonation patterns of the sen- 
tence. The reason again is clear. We wish 
morphemes to be parts of words and this 
they cannot be if they are simultaneous 
with an entire sequence of words. In this 
connection it should be noted that neither 
the index of synthesis nor any of the others 
utilized in the present study is a general 
measure of total complexity of a language. 
Intonational patterns and certain other 
items which contribute to the over-all 
complication of a language are not included. 
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The next step to be taken after the identi- 
fication of morphs is the establishment of 
more complex units, morphemes, with 
morphs as their members. It is this aspect 
of the problem which has been most fre- 
quently discussed in the literature as the 
chief subject-matter of morphemic analysis 
by Harris, Hockett, Bloch, Nida, and 
others. In general, the principles set forth 
by Nida are adequate (1948, pp. 414441). 
These involve the generally recognized 
criteria of sameness of meaning (here 
applied strictly) and of complementary 
distribution and the requirement that if 
morphs of varying phonemic shape are to 
be assigned to the same morphemic unit, 
there must be at least one nonvarying unit 
with at least as wide a distribution. 

On this point, however, for reasons to be 
explained, it is inadvisable to accept the 
complementations allowed by Nida in 
accordance with his rule that ‘“comple- 
mentary distribution in tactically different 
environments constitutes a basis for com- 
bining different forms into one morpheme 
only on the following condition: that some 
other morpheme—belonging to the same 
distribution class, and having either a 
single phonemic shape or phonologically 
defined alternant shapes—occurs in all the 
tactically different environments where 
the forms in question are found” (p. 421). 
For example, in Arabic there are pronominal 
suffixes indicating possession when added to 
the noun and another set indicating the 
verb object when added to the verb. These 
are tactically different: i.e., the verb in 
general could not be substituted for the 
noun or vice versa. The existence of -ka, 
“second person singular masculine,” and 
other phonemically identical forms in both 
series would, on the basis of Nida’s rule, 
allow us to unite the morphs of the first 
person singular: -7 and -ja, noun possessive, 
and -ni, verb object, as constituent morphs 
of the same morpheme. This alternation is, 
of course, irregular and in reckoning our 
agglutinative index, if we accepted this 
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alternative, we should have to call any 
construction involving one of the first 
person singular suffix forms irregular, or 
nonagglutinative. We should thus be penal- 
izing Arabic, as it were, for having some 
degree of regularity in these forms. In a 
language with two totally different sets of 
pronouns in these uses, Nida’s rule would 
not allow complementation; hence there 
would be no irregular alternations from this 
source, although, on a common-sense 
basis, we should call this the more irregular 
situation. Hence only members of the same 
structural set, i.e., those substitutable for 
each other in the same tactical environment, 
are considered as possible alternants of the 
same morpheme. 
Given certain morphs as alternants of the 
same basic morpheme units, we can define 
an agglutinative construction. It seems 
constant with the traditional use of the 
term to consider its basic reference to be to 
morphological regularity. However, the 
term regular has been used in a number of 
different ways. In the discussions of Bloom- 
field, Wells, and others it is now usual to 
distinguish types and degrees of regularity 
and irregularity. The definition of regularity 
adopted here is that which appears to be 
closest to actual usage in typological 
discussion. This is the requirement that all 
the varying phonemic shapes of the morph 
be derivable from a nonfictive (i.e., actually 
occurring) base form by rules of combination 
which hold for all similar combinations 
throughout the language. This is usually 
called automatic alternation. The case of a 
morpheme which does not have alternating 
morphs, i.e., which involves the same 
phonemic sequence in all occurrences, is a 
limiting instance which is, of course, also 
reckoned as automatic. Sometimes one 
choice of a base form gives automaticity, 
i.e., predictability from base forms, while 
another does not. In cases of doubt those 
base forms are chosen which give the greatest 
degree of automaticity for the total descrip- 
tion of the language. This is admittedly not 
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avery precise rule, but in practice this does 
not prove to be a major difficulty. 

We define automaticity as the property 
of the entire morpheme where every morph 
isin automatic alternation with every other. 
The morphs may often be grouped in 
subalternating sets. It is not sufficient for 
these to be in automatic alternation. The 
English plural morpheme has the statisti- 
cally most frequent subset -s -z -ez, which 
are in automatic alternation. However, 
there are other alternants, e.g., -en, -zero, 
ete., which are not in automatic alternation 
with -s -z -ez as a whole. Hence the English 
plural morpheme is not automatic. 

The possibility of calculating compound- 
ing, derivational, and inflectional indices 
depends on our ability to distinguish root, 
derivational, and inflectional morphemes. 
Of these the root class is probably the most 
difficult of formal definition, but the easiest 
of recognition. By this is meant that in 
actual practice, there is virtually complete 
agreement as to which morphemes are to 
be regarded as root morphemes. The root 
position in the word is characterized by a 
large and easily extendible membership and 
concreteness of meanings. The contrast 
with inflectional morphemes is greatest in 
these respects, since these latter tend to be 
few in number and relational as well as 
abstract in meaning. Another characteristic 
would generally be agreed on. Every word 
must have at least one root morpheme. 
Hence in a one-morpheme word, that 
morpheme is necessarily a root. In contrast, 
derivational and inflectional morphemes 
need not occur, and there are some lan- 
guages, so-called root or isolating languages, 
in which derivational and_ inflectional 
morphemes are rare or perhaps do not 
occur at all. Derivational morphemes may 
be defined as morphemes which, when in 
construction with a root morpheme, estab- 
lish a sequence which may always be 
substituted for some particular class of 
single morpheme in all instances without 
producing a change in the construction. If 
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the class of single morphemes for which the 
derivational sequence may substitute con- 
tains one of the morphemes in the deriva- 
tional sequence itself, we call the sequence 
endocentric; if not, then it is exocentric. 

For example, duckling in English is a 
derivational sequence, since it may be 
substituted anywhere for goose, turkey, etc. 
without change of constructional meaning. 
Since duck is included in this class of single 
morphemes for which duckling may sub- 
stitute, -ling is here an endocentric deriva- 
tional morpheme. Singer is an exocentric 
sequence, since the class of single morpheme 
sequences for which singer may substitute 
consists of single-morpheme nouns only, 


and does not include the verb sing. Hence 


-er is an exocentric derivational morpheme. 

We can now define the inflectional 
morpheme simply as a nonroot, nonderiva- 
tional morpheme making the three classes 
exhaustive and mutually exclusive. Inflec- 
tional morphemes, like derivational, need 
not occur at all in any particular language. 
When it is part of a word pattern, however, 
its appearance in the appropriate position 
is compulsory like that of the root. One 
member of the class is frequently zero. In 
these instances, the absence of an overt 
phonemic sequence shows itself as significant 
because the word in this form has definite 
syntactic limitations on its uses; e.g., the 
nominative singular in Turkish or the noun 
singular in English. 

We finally reach what is in some ways 
the most difficult problem, the definition of 
the word unit. It is clearly fundamental to 
the purpose of the present study inasmuch 
as all the indices involve the number of 
words. In most instances this is explicit; 
sometimes it is tacit, as in the index of 
agglutination, in which the number of 
morpheme junctures is always one less than 
the number of words. There is at present 
no general agreement on this topic. Some 
deny the validity of the word as a linguistic 
unit. Others admit it, but deny that it need 
be taken into account in the description of a 
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particular language. Some say the word is 
definable only for each language in a separate 
ad hoc fashion. Some define it in phonologi- 
cal, others in morphological terms. In 
practice, however, the word continues to be 
the key unit of most actual language 
descriptions. Of the two basic types of over- 
all definitions of the word, the phonological 
and the morphological, the former is clearly 
insufficient for the purposes of the present 
study. In phonological definitions, we 
define the word in terms of some single 
phonological characteristic, or through a 
combination of characteristics which serve 
as markers. These markers are usually 
stresses or boundary modifications of 
phonemes; i.e., junctures. Besides the fact 
that the use of phonological markers to 
define the word sometimes leads to the 
isolation of individual units which we 
should not wish to call words on other 
grounds, many languages do not have such 
phenomena, so that a phonological procedure 
cannot lead to a universal definition. The 
other basis has been called morphological, 
since it is based on the distribution of 
meaningful elements. Of definitions of this 
kind, Bloomfield’s characterization of the 
word as the minimum free form is the most 
satisfactory in that it is universally applica- 
ble and points in the right direction to 
freedom or absence of freedom, that is, 
bondage as the basic criterion. The actual 
test of freedom, ability to occur in isolation, 
is, however, difficult to apply in practice 
and leads to unusual results. For example, 
the in English would not be a word by 
Bloomfield’s test. 

The procedure adopted here can only be 
briefly outlined. It has led to results satis- 
factory for this study in the relatively few 
doubtful cases regarding the existence or 
nonexistence of a word boundary in the 
languages under consideration. Instead of 
asking whether a particular minimal form 
is bound or free in general, as is usually 
done, the present treatment is in terms of 
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morphs in particular contexts. This allows 
us, for example, in Latin to make ab, “from,” 
a free form as a preposition but a bound 
form as a verb prefix in abduco, “I lead 
away.”’ What is specified as bound or free 
is not a morph as such but a contextually 
determined class of mutually substitutable 
morphs. Such a class is here called a morph 
substitution class (MSC). This notion is 
expanded to include a sequence of mor 
pheme substitution classes which may in 
all circumstances be substituted for a 
particular MSC and none of whose member 
are identical in membership with it.* It is 
convenient to use the term nucleus to cover 
both individual MSC’s and such substitut- 
able sequences. Having broken up an 
utterance into nuclei in this manner, we 
now test each nucleus boundary to see if it 
is a word boundary or not. A nuclew 
boundary is a word boundary if it is possible 
to insert an indefinitely long sequence of 
nuclei. If it is an intraword boundary, 
either no nucleus can be inserted or a, fixed 
maximum number can be. For example, in 
the sentence ‘“‘the farmer killed the ugly 
duckling” there are nine morphemes: 
(1) the (2) farm (3) er (4) kill (5) ed (6) the 
(7) ugly (8) duck (9) ling; seven nuclei: 
(1) the (2) farmer (3) kill (4) ed (5) the 
(6) ugly (7) duckling; and six words: (1) the 
(2) farmer (3) killed (4) the (5) ugly (6) duck- 
ling. There is an intraword boundary at 
“‘kill-ed” because no nucleus may be inserted. 


3’ This is necessary in order to exclude endo- 
centric phrases in which a sequence of words can 
always be substituted for the head or chief num- 
ber. A sequence of adjectives followed by a single- 
morph noun would be a nucleus were it not for 
the proviso of dependence among its members. 
Adjectives are not bound to nouns in English, for 
example, because they occur in predicative adjec- 
tive constructions also. For the basic ideas of the 
MSC and the derivational sequence, I am largely 
indebted to the stimulus of the writings of Zellig 
S. Harris and Rulon S. Wells. The resemblance 
to the notions of focus class and expansion of the 
latter writer is particularly close. See especially 
R. Wells, 1947, pp. 81-117. 
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TABLE 1 

Sanskrit |Anglo-Saxon| Persian English Yakut Swahili Annamite | Eskimo 
INNES Scns on wns 2.59 2.12 1.52 1.68 2.17 2.55 1.06 3.72 
Agglutination......... .09 oe 34 .30 51 67 ne -03 
Compounding.........} 1.13 1.00 1.03 1.00 1.02 1.00 1.07 1.00 
Perivation............ .62 .20 10 15 .30 .07 .00 1.25 
Gross inflection....... 84 .90 .39 53 .82 .80 .00 1.75 
a ee .16 .06 01 04 .00 1.16 .00 .00 
ee 1.18 1.03 49 .64 1.15 41 .00 2.72 
OS .16 15 .52 75 .29 .40 1.00 .02 
Pure inflection. ....... 46 47 29 14 .59 19 .00 46 
mempord sb 2... bec le | .38 .38 19 11 12 41 .00 .38 


























On the other hand at the boundary between 
“farmer” and “killed” there is no fixed 
maximum number of insertable nuclei. We 
may talk of the “farmer who killed the man 
who killed the man who... killed the ugly 
duckling.” The contradiction with the 
phonological word is in certain cases merely 
apparent. Thus in Latin the enclitic -que, 
“and,” which is reckoned as a syllable with 
any preceding sequence in locating the 
stress which serves as a phonological word 
marker, is also part of the word by the 
present test. Dd’minus, “the lord,’ and 
do'minis in dé’miniisque, ‘and the lord,” 
are not members of the same MSC because 
they are not substitutable for each other. 
Dé’minis- belongs to the same nucleus as 
legatuis-, puér, and this class is dependent on 
the class of the following -que, -ve since it 
must be followed by it and hence belongs to 
the same word. Even with monosyllables, 
where there is no stress shift, ma’s, ‘“mouse,”’ 
and the mis of mii’s-que, ‘‘and the mouse,” 
are members of different nuclei, since the 
former can be substituted only by dé’minus, 
puér etc. The latter only by the class of 
dé’minis-, puér-. 

Table 1 shows the calculated indices. 
The languages selected are chiefly those 
frequently cited as examples of specific 
types in the existing literature in typology, 
subject to the limitations of my own knowl- 
edge of specific languages. Instead of 
Turkish, the related Yakut was selected as 


an example of an agglutinating language, 
since the extensive Arabic borrowings in 
Osmanli Turkish have led to irregularities in 
the vowel harmony and in other respects to 
such an extent as to render it untypical. 
Two ancient Indo-European languages, 
Anglo-Saxon and Sanskrit, were chosen, and 
two modern languages of the same German 
and Indo-Iranian branches, modern English 
and Persian, were also selected to illustrate 
long-term change in type. Annamite was 
selected as a representative root isolating 
language, Eskimo as polysynthetic, and 
Swahili as an agglutinative, concordial 
Bantu language. (See accompanying table.) 

On the basis of counts such as these, the 
next step would be to define terms like 
analytic, synthetic, agglutinative, and pre- 

4The passages selected were 100 words long 
for each language, as follows: Sanskrit, Hitopadésa, 
ed. Max Mueller, p. 5, varam ekas, ff.; Anglo- 
Saxon, An Anglo-Sazxan Reader, J. W. Bright (New 
York, 1917), p. 5, hit gelamp gio, ff.; Persian, Pizzi, 
I., Chrestomathie Persane (Turin, 1889), p. 107, 
ruzt Ibrahim, ff.; English, New Yorker, Decem- 
ber 13, 1952, p. 29, Anyone who, ff.; Yakut, Ueber 
die Sprache der Yakuten (St. Petersburg, 1851), 
p. 29, min bdéydsé ff.; Swahili, C. Sacleux Gram- 
maire Swahilie (Paris, 1909), p. 321, Kiyana mmoja, 
ff.; Annamite, M. B. Emeneau, Studies in Viet- 
namese (Annamese) Grammar (Berkeley and Los 
Angeles, 1951), p. 226, mot hom, etc.; Eskimo, W. 
Thalbitzer in Handbook of American Indian Lan- 
guages, Part I, ed. Franz Boas (Washington, 1911), 
p. 1066, kaasasurujuynuag, ff (phonemicized and 
slightly normalized to conform to Kleinschmidt’s 
grammatical description). 
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fixing after plotting a frequency distribution 
curve. There are too few languages here to 
make this procedure feasible. However, 
even cursory inspection of the indices set 
forth here shows that, if we define an 
analytic language as one with a synthetic 
index of 1.00-1.99, synthetic as 2.00-2.99, 
and polysynthetic as 3.00 +, the results 
would conform to the usual nonquantitative 
judgments. Similarly we might call a 
language agglutinative if its agglutination 
index is over 0.50 and similarly for the 
others. 

The present results should also be con- 
firmed by further counts, since they were 
obtained only for single passages of 100 
words each, besides which the probable 
error should be calculated for each. One 
might well suspect differences based on the 
style of the passages selected. The synthetic 
index was calculated in English and German 
for a number of passages of varying style, 
however, with remarkably concordant re- 
sults. 

English 


Ladies’ Home Journal, January 
ES eh rears 1.62 
R. Linton, Study of Man, p. 271.. 1.65 
O. J. Kaplan, Mental Disorders in 
| OE Oe ee 
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German 


Baumann, Nama Folk-tale....... 1.90 
Ratsel, Anthropogeographie, p. 447. 1.92 
Cassirer, Philosophie der Symboli- 

achen Forman; Qe 8... o 5. sii 


This, of course, is no substitute for 
statistical evaluation. Another topic that 
can be studied by the present method is the 
general direction of historical changes in 
language over an extended period. The 
coincidences between Sanskrit and Anglo- 
Saxon on the one hand, and Persian and 
English on the other hand, are striking. The 
direction of change for virtually every 
index is the same from the older to the 
more recent language. Were more conserva- 
tive Indo-European languages such as the 
Slavonic chosen, the results might well be 
different. 

The present study is to be evaluated as 
purely a preliminary attempt. Some indices 
may well have to be eliminated and others 
substituted. The specific definitional choices 
may likewise be subject to alteration in 
future studies. However, the general method 
of the calculation of indices based on text 
ratios of carefully defined linguistic ele- 
ments has, I believe, definite value for 
typological studies. 
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), Greenberg’s use of objective measures 
derived from morphology (for the most part) 
as a device for typological classification and 
John Carroll’s use of similar measures in a 
stylistic study of English! suggest that some 
grt of useful numerical indices might be 
obtained from syntactical characteristics of 
languages. 

Some of the most obvious and frequently 
mentioned syntactical differences do not 
easily lend themselves to this technique; for 
instance, the presence or absence of some- 
thing which can reasonably be called a ‘case 
system’, or ‘tense’, or ‘aspect’, the use of 
structural signs for the subject of intransitive 
verbs similar to those for either the object or 
else the subject of transitive verbs, the use 
of the orders SOV, SVO, VSO, VOS, OSV, or 
OVS as necessary or preferred in transitive 
sentences, the presence or absence of 
an inclusive-exclusive contrast in person 
markers, of a dual number (or of number as a 
category at all), of suppletion in the verb 
system—all these, and many more like them, 
are difficult to reduce to a meaningful 
number. 


1. But there are others. Greenberg him- 
self uses one set (his battery of three based 
on the relative use of three methods of 
marking a nexus,—what he calls ‘Isola- 
tion’—apparently including word-order and 


1John B. Carroll, “A Factor Analysis of 
Literary Style’ in Style in Language, Part IX, 
ed. T. A. Sebeok (New York and London: The 
Technology Press of M. I. T., and John Wiley 
and Sons, Inc., 1960). 
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particles as markers—‘Pure inflection’ and 
‘Concord’—i.e. inflection which also serves 
some other function) which is largely or 
wholly syntactical, and some of his others 
suggest easy extensions to syntax. His meas- 
ures of synthesis (ratio of morphs to words) 
and compounding (stems to words) suggest 
a similar measure, either of words per 
sentence (used by Carroll as a stylistic 
measure, and probably unreliable as a typo- 
logical one) or of words per phrase (assuming 
we can get a workable and comparable 
criterion for identifying phrases). His ‘agglu- 
tination’ index could be matched by a 
‘sentence sandhi’ index; but again (a) stylis- 
tic variability would make it unreliable, and 
(b) the information required is often difficult 
to get. His derivation index (derivative 
affixes per word), gross inflection (inflec- 
tional affixes per word), prefixing (prefixes 
per word) and suffixing (suffixes per word) 
suggest similar ratios involving particles 
(again assuming a consistent and compara- 
ble definition): ratio of particles to words 
(this seems less liable to stylistic variation 
than particles to phrases or sentences), of 
prepositive particles (proclitics, prepositions, 
many conjunctions, etc.) to words, and of 
postpositive particles (enclitics, postposi- 
tions, emphatic particles, sentence-final par- 
ticles, etc.) to words. I have worked these 
figures out for a few simple 100-word pas- 
sages of English and of classical Greek.? 


Eng. I II III Plato Thuc. Xen 

Pre/w oF .3P*.. See 220 16 
Post/w OL .O8e- OO ct7 > :04.i1 
OMAN Suds kvacss .02 .382 .38 .o2 .28 .26 





2 In English, one passage in Greenberg’s article 
beginning on page 195, one from TIME magazine, 
April 4, 1960, page 20 (beginning with ‘‘Amidst’’) 
and another from page 55 (‘‘The acid bath’’); in 
Greek, one from Plato’s Apology (beginning at 
21A), one from Thucydides (2.7. 1-2), and one 
from Xenophon (Anabasis, beginning at the 
beginning). 
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These ratios seem to be fairly stable, 
though there is plainly some stylistic varia- 
tion. In regard to the total particle ratio, 
English and Greek do not differ greatly 
(though other languages—e.g. Turkish— 
would), but there is a striking difference in 
the proportion of postposed vs. preposed 
particles, where Greek shows many more 
postposed particles than English (in one 
sample, Plato, these are even more numerous 
than the preposed ones). The ratio of words 
to sentences, mentioned above, shows the 
following values for the same 6 samples. 


EI 
16.7 


E II 
20 


EIII Pl. Th. Xe. 


Ww/S 12.5 7.7 33.3 14.3 


Here the variation in the Greek samples is 
greater than that between English and 
Greek; obviously this measure is useless for 
typology (at least in this form), however 
good it may be for stylistics. 


2. Another measure which might be com- 
pared to the prefixing-suffixing ratios is one 
concerned with the number of words in 
constructions which precede their heads, as 
compared with the number which follow. 
This requires some arbitrary decisions in the 
case of so-called exocentric constructions; 
Ihave chosen to consider the preposition as 
head of prepositional phrases (on the ground 
that ‘The master is in the house’”’ has more 
in common with “The master is in” than 
with “The master is the house’’), subordi- 
nating conjunctions as heads of their clauses, 
and verbs or verb-phrases as heads of 
sentences. The best measure appears to be 
that of words before heads to words after 
heads (rather than of these to total words). I 
give the figures here for three Greek and 
three English passages.* 


Hu. 
.27 


Gr. 
.3l 


Gi. 
.29 


Xel 
.49 


Xe V 
.46 


Pl. 


B/A 58 





3The English passages are (1) a 378-word 
stretch from a detective novel by Kyle Hunt, To 
Kill a Killer (New York: Random House, 1960), 
p. 44; (2) a passage of equal length from Green- 
berg’s article (including the 100-word passage used 
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While there is obviously some stylistis 
range here (in both languages), in this cag 
at least, the difference between the language 
is much greater. It is interesting that th 
greatest difference is between Hunt ani 
Plato, since both of these passages involv 
many short, simple, colloquial sentences (iy 
contrast to the other four). 


3. A measure of flexibility of order might 
be a desirable figure to have; the ideal, per. 
haps, if one could calculate it, would be P/§ 
possible permutations divided by the nun. 
ber of sentences. A figure which will d 
almost as well, and is much easier to arrive 
at, may be had by considering each word in 
turn and noting how many other positions 
in the sentence it could be shifted to (if m 
other changes are made), and dividing this 
total M by W, the number of words. If we 
take the Greenberg 100-word passage al- 
ready used, we get a flexibility ratio of .13, 
and for the Gibbon passage .15, while fo 
Plato we have .68 and for Xenophon (1.1) 
.92. If we extend our rules to consider not 
only single words, but also phrases of two 
and of three words (but none longer), we 
may get for Greenberg a ratio of .18, for 
Gibbon .31, for Plato 1.09 and for Xenophon 
1.45. Again the variation within each lan- 
guage seems much less than the difference 
between the two. 


4. Many more measures are probably 
worth testing, though some of them may 
prove to be too bound up with style, rather| 
than language. For instance, at the last! 
Linguistic Society meeting V. H. Yngve ci 
M. I. T. presented an interesting procedure 
for analyzing sentence structure‘ which 
yields for each sentence an integer called 





before), and (3) a 100-word passage from Gibbon’s 
Decline and Fall of the Roman Empire from the 
beginning of Chapter 47. The Greek passages are: 
the first 148 words of Xenophon’s Anabasis, 19 
words from the Anabasis 5.1. 8-11, and the same| 
100 word passage of Plato that was used before.| 

*A Model and an Hypothesis for Language] 
Structure. 
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maximum depth’. It is at least possible that 
fie average maximum depth over a suffi- 
iently large random sample of a language 
jould prove to be characteristic for the 
nguage. A very similar measure might be 
jgsed on the average number of transforma- 
jis required to generate each sentence in 
ach a random sample; this, however, re- 
wires that consistent transformational 
sammars on the same pattern are available 
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be entirely style-free (or mutually inde- 
pendent: Kroeber’s arrangement in rank 
ders shows 3b and 4b with exactly the 
sme order, 1 and 3c extremely similar); 
when I calculate them for the same one- 
hundred word sequence from his own paper 
which we have used before, I get 1.64 for 
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M/W, .35 for A/J, 1.05 R/W, .36 D/W, .22 
I/W, 1.5 P/W and .54 S/W. This suggests 
that M/W (which he had already tested 
with a variety of passages) is indeed fairly 
constant, but D/W and I/W distinctly less 
so; either of these figures would substan- 
tially alter the rank order of languages. 
Probably longer stretches than 100 words 
would help to even things out, but even so 
derivation and compounding (particularly 
in Sanskrit, for instance) can vary consider- 
ably with style. 

So we come to the last of our first 
thoughts. There are probably at least a half- 
dozen reasonably consistent syntactical in- 
dices which can be used for typological 
purposes; like Greenberg’s morphological 
‘indices, all of them (as well as others whose 
value cannot be decided in advance) must 
be thoroughly tested for degree of freedom 
from stylistic influence and for general 
consistency from language to language. 
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0. Scope 

. Density ranges in morpho-syntax 

- Range of stem sequences in text and 

inventory 

Range of affixes and operators in text 

and inventory 

Allomorphic density range 

. Phonemic density range in text and 
inventory 

. Implication of phonological and other 
subsystem typology 

4. Non-binary nature of density ranges 


0. A brief three-stage history of the whole 
language typology problem may set it in 
perspective. Underlying its manifest ‘his- 
tory’, work in typology reflects broad ‘in- 
terests’ or ‘needs’ which are not satisfied in 
structural linguistics, with its emphasis on 
what is unique in a given language. All 
typologies—subsystem as well as whole lan- 
guage—look for what is comparable among 
natural languages, whether related or not. 

0.1. The 19th century linguists included 
ethnocentric considerations in their typol- 
ogy. On the linguistic side, their typology 
was set up under the assumption that some 
morphosyntactic features of a given lan- 
guage will serve as an index or indication of 
the type of that language as a whole. Such 
typology attempts to measure the whole 
language—by implying a correlation be- 
tween parts of the language treated in the 
typology and parts not treated. 

0.2. Sapir excluded all extra-linguistic 
considerations in his typology. He eschewed 
the evolutionary and ranking interests of 
his predecessors, but otherwise followed the 
main logic of whole language typology—a 
logic of metonymy in which a few parts (i.e. 
a low number of morphosyntactic features) 
represent the whole. Like the 19th century 


typologists, Sapir cited features for a give 
language as though the language in questig 
were fully described. From the inventory ¢ 
any of its features—in effect, a verticg 
storage of all relevant features—Sapir woul 
select certain ones for purposes of typol 
gizing. The dozen terms that Sapir used fy 
his features or indices are not equal 
coordinate as such, but are coordinate i 
terms of three axes or bases: formal prom 
esses (isolating, symbolic, affixing by prefiw 
or suffixes or infixes); semantically oriente 
word construction (analytic, synthetic, poly 
synthetic); semantically oriented morpho! 
phonemics (fusing-inflective, juxaposing- ay) 
glutinative). 





Sapir typologized exclusively from a vert 
cal storage source of information for a give 
language (‘lists and rules’), thereby imposing] 
a unidirectional procedure: it became neces| 
sary to structuralize before typologizing :| 
given language—a necessity that must hav 
seemed so obvious as to be not worth mer- 
tioning. But for other considerations, s¢ 
3.2, below. 

0.3. Greenberg did more than quantify 
Sapir. He changed the basis of typology 
from a vertical storage base of information, 
as used by Sapir, to a horizontal or syntag/ 


matic text frequency count. In programming! 


a translation computer, one is faced with! 
much the same kind of problem that Green- 


berg proposes for typologizing. In either 


case, it is possible to program a computer 0 
typologize a given language with one set 0 
texts. 

0.4. The three-stage history of whole 
language typology (0.1-0.3, above) is 9 
isolated from the history of subsystem 
typology, that workers in one development 
do not in general appreciate work in the 
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other.! Whenever new entities become iso- 
lable in structural linguistics, they lend 
themselves to typologizing—from the pho- 
nological typologies of Trubetzkoy to the 
one suggested by Harris for kernel sentences 
in transformational grammar. 

We now attempt to go beyond phonology, 
without assuming that we have, in some new 
or old way, hit upon a short-cut to character- 
iing a whole language through a few salient 
features. When we state the phonemic 
density ranges in languages A... it be- 
comes possible to compare the subsystem in 
question among languages A .. .n. We move 
cautiously from one part of a language 
typologized to another part. If the typology 
of one part implicates that of another part, 
that would be important—but discoverable 
only after preliminary typologizing of more 
than one system across languages. 

0.5. For each part of a language being 
typologized, we have recourse to (1) a text, 
as well as to (2) ‘inventory’. By ‘inventory’ 
we mean: the vertical storage as given in a 
grammar of a more or less fully described 
language. But we wish to typologize lan- 
guages for which full length grammars are 


1For whole language typology, see Kroeber’s 
paper and Greenberg’s paper in this issue of IJAL, 
and the bibliographic references by them. For 
bibliography of subsystem typology, beginning 
with Trubetzkoy, see Charles F. Hockett, Manual 
of Phonology (IUPAL 11 = IJAL 21 No. 4, 1955), 
and subsequently C. F. Voegelin, On Developing 
New Typologies and Revising Old Ones, SJA 
11.355-60 (1955); Six Statements for a Phonemic 
Inventory, IJAL 23.78-84 (1956); Subsystems 
Within Systems in Cultural and Linguistic Ty- 
pologies in For Roman Jakobson, The Hague 
(1956); The Scope of Whole System (‘Distinctive 
Feature’) and Subsystem Typologies, (with John 
Yegerlehner) Word 12.445-54 (1956); Linear 
Phonemes and Additive Components [SGC], Word 
12.429-48 (1956); and with reference to Harris, 
review of Syntactic Structures by Noam Chomsky, 
IJAL 24.229-31 (1958); John Yegerlehner and 
Florence M. Voegelin, Frequencies and Inven- 
tories of Phonemes from Nine Languages, IJAL 
23.85-93 (1957); Henning Spang-Hanssen, Proba- 
bility and Structural Classification in Language 
Description (Copenhagen, 1959). 
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not yet available but which have been 
sufficiently analyzed to permit recognition 
of the overall structure. For such languages, 
still under analysis, we use ‘inventory’ to 
refer to some approximation of a grammar, 
which at a minimum would contain numer- 
ous forms obtained by ancillary eliciting 
from the text cited. Here, (1) text and (2) 
inventory are not taken as independent 
variables; rather, (1) is related to (2) for 
each density range considered. 

0.6. With the exception of the term used 
in the title, ‘the typology of density 
ranges’ uses only those terms already defined 
for structural linguistics. 


1.0. Statements in this section are con- 
cerned with various morpheme classes. The 
assumption is that nouns (N) in one lan- 
guage are typologically comparable to nouns 
(N) another language only in the context of 
the structural basis of each language. Thus, 
since the total number of stem classes are 
three for Hopi, for example, but five for 
Kannada, a translation between these lan- 
guages would show that some morphemes 
which belong to verb class in Hopi would 
translate into Kannada verb class, while 
others would translate into Kannada adjec- 
tive class. So also Kannada morphemes in 
adverb class would translate either in Hopi 
verb class or Hopi particle class, and so on. 
This rather general caveat incidentally 
points up the relevance of typology to trans- 
lation problems. 

1.1. In compounding, the minimum num- 
ber of stems is two, though some languages 
permit more than two stems in one word. 
Where there is only one stem in a word, there 
is no compounding. For range of stem se- 
quence in words, however, compounding, as 
usually identified, is merely one magnitude 
in the range which actually extends from 
zero (as when prefix-suffix sequences without a 
mediating stem are counted as words), to 
one, to more than one stem (thereby com- 
pounding). There exist some languages in 
which the range of stem density is nil be- 
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cause the minimum number of stems and 
the maximum number of stems is exactly 
the same. The lower limit or minimum 
density range for some languages is one 
stem rather than zero stem—that is, for 
languages which have all prefix-suffix occur- 
rences mediated by a stem. The upper limit 
of maximum density range for some lan- 
guages is one stem rather than more than 
one, but for other languages is more than 
one—up to a half dozen or even more. 

The minimax gives the range rather than 
the average of stem density; and it gives the 
range for each particular morpheme class of 
stem as verb (V), noun (N), particle (P), 
and so on, rather than for stems as an undif- 
ferentiated overall class, hereinafter called 
the major morpheme class. 

We want the differentials selected for the 
typology to be sensitive to the differentials 
that obtain in the languages to be typolo- 
gized. If, now, we were to state the minimax 
for stem density in terms of an overall major 
morpheme class, we would obtain a less 
sensitive differential than if we were to give, 
as we do, the stem density ranges for each 
class of stems as N and V and P. For exam- 
ple, in a given language the stem density 
descends from a maximum of three noun 
stems (NNN) to a minimum of one (N) 
while the verb stem density range descends 
from two verb stems (VV) to one (V). If, 
now, N and V are merged as a single major 
morpheme class, the report gives a single 
stem density range (one to three), and 
thereby conceals the differential between N 
and V density ranges. We want to obtain 
density ranges which will reveal such differ- 
entials. 

For certain languages, however, the stem 
density range as given for the text is com- 
parable to the stem density range from the 
inventory only when recursive extensions of 
compounding (possible in the inventory) are 
excluded. We do not, however, exclude men- 
tion of the recursive possibility of such com- 
pounding, since recursiveness is relevant to 
maximum density, but rather include such 
mention in terms of a formula beside the 
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one. Having raised the question of the de- 
sending order of stems from information 
given by the Kannada text sample (N, V, A, 
Adv., NV; NN in the same rank as VV; so 
also AN, VVV in the same rank; and finally 
NNN), we turn to the inventory in Kannada. 

Faced with the inventories of Hopi and 
Kannada, we ask a series of questions, which 
are given in the following paragraphs, to- 
gether with general answers. This shows the 
related uses of the sample text and of the in- 
ventory in respect to stem density ranges. 
The text orients us, or, in other words, sug- 
gests questions. The inventory answers these 
questions, if it is a sufficiently full inventory; 
without the text, the full inventory turns out 
to be so rich in its storage that it leads us 
away from the obvious kind of questions that 
are raised recurrently in texts: 

Is the inventory minimum the same as 
the text for stem density ranges? The answer 
is yes for Hopi, and yes also for Kannada. 

Have some instances of every word class 
appeared in the sample text? The answer is 
yes for Hopi, but no for Kannada. That is, 
the Kannada inventory includes the stem 
class of particles, but the sample text does 
not. 

Has some one or more instance of every 
variety of compounding appeared in the 
sample texts? The answer is no for Hopi and 
no also for Kannada. 

This brings us to the point of stating mini- 
max in stem density ranges. This is easy to 
state from the sample texts. It is also easy 
to state the minimum in stem density range 
from the inventory. However, the inventory 
does not give us a correspondingly easy basis 
for stating the maximum in stem density 
range. We do consider some of the difficulties 
for each of the languages sampled, in subse- 
quent installments of this paper, somewhat 
in the spirit of offering a typology of diffi- 
culties. 


1.2. Parallel to the cover-term ‘major 
morpheme class’ for stem classes, we use the 
cover term ‘minor morpheme class’ for af- 
fixes and operators. In general, major mor- 
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phemes have a higher list count than minor 
morphemes, while minor morphemes have a 
greater text frequency than major mor- 
phemes.? Morphemes from both of these 
overarching classes are typologized with ref- 
erence to particular stem classes. For exam- 
ple, in the case of affixes we do more than 
distinguish between prefixes and infixes and 
suffixes. We distinguish prefixes which ap- 
pear in sequence with one particular stem 
class from those which appear in sequence 
with another class or with either of two par- 
ticular stem classes—and so on. 

Affixes would appear to need no special 
identification. Though prefixes in general 
precede stems and suffixes follow them, se- 
quences of prefixes and suffixes appear in 
some languages without a mediating stem 
(or alternatively, with zero stem)—notably 
in Algonquian; and the same affix appears 
before the stem in some sequences, after the 
stem in other sequences (as the person mark- 
ing ‘ambifixes’ of Hidatsa). In short, the 
usual notion of an affix as a bound morpheme 
with unique position class in respect to a 
stem is not adequate for all types of lan- 
guages. 

We treat operators as coordinate with af- 
fixes, and distinguish four kinds of operators: 
reduplication which is morphemic; phoneme 
replacement which is morphemic;’ subtrac- 


2 This is expressed as a correlation rather than 
as a criterion. If one were to use this statement 
as a criterion for inclusion in minor morpheme 
class, then one would count here certain particles 
in certain languages—Germanic languages with 
their prepositions, etc., for example (Hockett’s 
‘functors’). But by formal criteria alone, particles 
are included with stem classes rather than with 
affixes and operators. 

3 In Hopi, the plural of actor is marked by affix 
for some verb stems and by suppletion for other 
verb stems. Such suppletion is an example of 
maximum phoneme replacement, since there is no 
phonemic resemblance between V2 for singular 
actor (redundantly marked by one noun or noun 
substitute, N, which co-occurs with the verb) and 
V2 for plural actor (marked by another co-occur- 
ring N). The morphemic density would be the 
same whether one states piti singular actor arrives, 
as V;, one morpheme, and ?bki plural actor arrives, 
as V2, two morphemes (verb stem as one mor- 
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tion which is morphemic (Bloomfield’s ‘mi- 
nus feature’); and, finally, word order which 
is morphemic. It is possible to encounter a 
fixed reduplication which is not morphemic 
in some languages (e.g. the verb in ‘He pooh- 
poohed it’). If not morphemic, phoneme 
replacement and subtraction are morpho- 
phonemic, as indicated in fn. 3. So also, when 
not morphemic, transformations in word 
order are distinguished (as allosyntactic free 
alternants) from transformations included 
under operators. 

An operator functions like an affix—.e. 
with reference to a particular stem class or 
word class—and yet an operator is itself not 
an affix. Recognition of the coordinate na- 
ture of affixes and operators is useful in 
typologizing across languages and in struc- 
turalizing within languages. In typology, it 
permits such statements as that goal is 
marked by an affix in one language, by word 
order in another. In structuralizing a given 
language, as Hopi, it permits such state- 
ments as that plural is marked by affix alone 
for some nouns (suffix); by operator alone 
for other nouns (reduplication); and, redun- 
dantly, by both affix and operator for still 
other nouns. 

The next section, 2, gives instructions for 
reading a simple but unfamiliar notation for 
allomorphic density. The same notation 
serves also for indicating morpheme class 
and word class. Where a single stem is in- 
cluded in a word, and where the particular 
class of the stem (as N, V, etc.), is the same 
as that of the word, no special notation is 
used. Thus, the notation N-3 indicates that 
a stem of noun class is followed by three suf- 
fixes, and that the sequence as a whole be- 
longs to the noun word class. We would key 
English hard as A (adjective class stem) 
which remains A for word class; but in con- 





pheme and the operator, maximum phoneme re- 
placement, as the other morpheme), on the one 
hand, or, on the other hand whether one states 
?6ki as a morpheme alternant of piti before 
plural suffix with the shape zero. The latter would 
also yield a density of two morphemes: pit? + 
@ = ki. 
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trast we would key examination as V-1 
noun (i.e., examine is verb stem, followed }y 
one suffix, -ation, with the emerging wor 
class different than the stem class and hen 
specified in the notation), as in the phrag! 
hard examination: A V-1 — noun. Similar} 
we would key harden as A-1 — verb; ands 
on. Whenever two or more different ste 
classes are specified in compound, the not, 
tion indicates the emerging word class.‘ 


2. The allomorphic density of a given lap 
guage is comparable to that of another lay 
guage only when some agreement is reachel 
as to what variations in a language are to} 
included, and what are to be excluded in thy 
comparison. Decision is easy when practic 
considerations are involved. We are agred 
that it would be extremely cumbersome an/ 
impractical to include variations due to pan. 
linguistic influences; pharyngealization ( 
vowels in some kinds of emotional speech i 
an example. Stylistic variations, as for work 
in chants, are also excludable. So also ar 
idiolectic or dialectic differences amoy 
speakers. 

Some typologies go so far as to distinguit 
automatic morphophonemics—variations de 
rivable from a base form by rules—fron 
non-automatic morphophonemics. There i: ‘ Itis 
an interesting procedural difficulty with thi} petween 
distinction. If the first published version | what is ¢ 
Tiibatulabal were typologized, morphem *¢ isom 
alternants involving changes of vowel length} “ition. 
would be counted as non-automatic; the} an 
are described by lists, with one kind of vowél jem in 3 
change peculiar to certain sets of allomorphs) The p 
and another kind peculiar to other sets | se: to 
allomorphs. If the morphophonemic restate| *Y!¢s 4” 
ment of Tiibatulabal were typologized, the ct 
alternants involving changes of vowel lengtl| . 








4 Word classes may be distinguished not onl ond ns 
by morphological criteria, as for the present ex| ea 
amples, but also by tagmemic (or tagmetic?)) oon ia 
criteria, to be discussed in another installment ¢!) F level: 
this paper. So also, reserved for future discussion, distinet; 
are distinctions among affixes, as ‘derivational 
versus ‘inflectional’, and ‘closing’ versus ‘col: 
tinuing’ in strings of affixes. 


same lat 
which ai 


OL. XxXy 


s V-1e 
lowed by 
ing worl 


nd henge 


e phrag! 
similarly 
D; and g 
ent ster 
the nots 
ass.4 


‘iven lap, 
ther lap. 
3 reachel 
are toh 
ed in th 
practic 
re agree 
some ani 
2 to pan 
ation ¢ 
speech i: 
for word 

also ar 
3 amon 


stinguis 
ations de 
es—tfron 
There i: 
with thi 
rersion (i 
1orphem: 
vel length 
tic; they 
| of vowed 
omorphs 
ar sets a 


ce restate! 


ined, the adequate descriptions. We will attempt this for 


vel length 
1 not only 


resent ex 
tagmetic? 


rivational 


rsus ‘Col: 


yo. 3 


would be counted as automatic, since these 
aiternants are derived from special formulae 
or bases (new symbols added, and morpho- 
phonemic values set up beside phonemic val- 
ues for the symbols previously used)—de- 
rived from such formulae by quite complex 
rules. In consequence, the original structur- 
ing of Tibatulabal would yield one morpho- 
phonemic type, while the structural restate- 
ment would yield another type. 

This points up a general danger to be 
avoided if possible—the danger of obtaining 
atypology of descriptions instead of a typol- 
ogy of particular languages. To obtain the 
latter, we want our typology to come out the 
same way, irrespective of whether it is based 
on some first description or some structural 
restatement, assuming of course that there 
isno contradiction between the two struc- 
turalizations (as there is certainly not in the 
Tibatulabal instance).® 

So far as allomorphic density range is con- 
cerned, we obtain such desired uniformity 
in typology by counting in one morphopho- 
nemic category morphemes whose allo- 
morphs are automatically changeable (as 
identified above), and morphemes whose 
allomorphs are not automatically changeable 
and are, accordingly, statable only by lists. 


5 Tt is easier to say what is not in contradiction 
between two successive structuralizations than 
what is contradictory. For example, where the two 
are isomorphic they are certainly not in contra- 
diction. Reference to the earlier morphophonemic 
statement is given in Martin Joos’ Readings in 
Linguistics, 88-92 (Swadesh and Voegelin, A Prob- 
lem in Phonological Alternation, Lg. 15.1-10). 

The problem discussed at this point is a general 
one: to make typology independent of different 





allment ¢! 
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styles and fashions of structuralizing that produce 


morphosyntactic density ranges by merging all 
| classes of morphemes, or those that are distin- 
| guished in non-unique ways, in one density range; 
land we will report our results in subsequent 
‘installments of this paper. Though it has already 
"been said that this is not very interesting, since 
jit levels differentials, it nevertheless permits 
\distinction among different descriptions of the 
‘same language which are isomorphic from those 


which are not. 
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Such ‘changeable’ morphemes are indicated 
by a prime after the symbols for stem or op- 
erator or affix in a particular sequence. In the 
case of English plural, /-s ~ -z ~ -iz/ exem- 
plifies a set of automatic alternants; but 
/-n/ and other allomorphs exemplify the 
non-automatic alternants. Thus, merging 
automatic and non-automatic alternations 
into one general category of allomorphs, we 
would key books, boys and oxen as N-1’, but 
wives and houses as N’-1’. In the first set, 
each noun stem has a ‘unique’ shape (book, 
boy, ox), and is accordingly symbolized by N 
without a prime. In the second set, the noun 
stems have more than one allomorph, and 
hence are symbolized by N with a prime 
(N’). In all instances cited, the plural suffix 
is changeable and hence symbolized by a 
prime on the number denoting how many 
suffixes follow the N’ or N, namely N’-1’ or 
N-1’. When a language has two affixes after 
N, for example, each having more than one 
allomorph, this is indicated by two primes 
after the number giving the sum of affixes: 
2’’. If one of the two suffixes had more than 
one allomorph, and the other had a unique 
shape, this would be indicated by one prime 
after the number giving the sum of the suf- 
fixes: N-2’. 

The preceding exposition is spelled out in 
a very elementary manner because it intro- 
duces an unfamiliar notation for marking 
morphophonemics. We turn now to a pro- 
cedural reservation on some extended uses 
of this notation. 

Though we cover automatic alternation 
by this notation, we do so only when two 
adjacent morphemes are involved—within or 
between words (internal and external sandhi) 
—in the span relevant to the morphophone- 
mics. But longer spans than this are relevant 
to what might be called ‘long span morpho- 
phonemics’—e.g., vowel harmony in Osmanli 
Turkish, alternation of stress in Hopi, and 
alternation of vowel loss in Kannada. Should 
we use our morphophonemic notation where 
many of the allomorphs in a long word (as in 
Turkish) or in a contour bounded by pause 
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juncture (as in Hopi and Kannada) are 
statable in terms of a span longer than adja- 
cent morphemes? We think not. In long span 
morphophonemics, the allomorphs of several 
successive morphemes can be stated at one 
stroke by indicating the onset and direction 
of the alternations. When taken individually, 
each morpheme may be found to have allo- 
morphs, while the successive changes really 
constitute a single alternation over the whole 
span. Where such long span morphophone- 
mics obtains, we count it as just one change 
for the whole span rather than as the sum of 
all the particular changes. 

Thus a Turkish verb stem followed by a 
half dozen suffixes, all of them participating 
in a vowel harmony alternation, would be 
noted as (V-6)’. Hopi alternation of stress 
and Kannada alternation of vowel loss is 
given by rule rather than by special nota- 
tion. 


3.1. Phonological subsystems have been 
sufficiently explored (see bibliography in fn. 
1) to require no summary here. Density 
ranges can obviously be stated, in terms of 
the ordering principle followed in listing 
phonemes (as in our Guide to Transcribing, 
or as in Hockett’s Manuwal—obstruents vs. 
sonorants vs. others for the latter; or the 
more conventional distinction in the former 
—stops vs. fricatives vs. nasals vs. liquids 
vs. semivowels). Some languages show a 
denser population of one kind of consonant 
or another. So also for vowels, the density is 
given according to the ordering principle of 
rounded-unrounded, front to back, high to 
low, where the inventories of particular lan- 
guages are compared. So also, Series Gener- 
ating Components (SGC) can be stated in 
terms of density ranges. Other unit pho- 
nemes, as junctures, can be stated as density 
ranges in respect to the length of span be- 
tween pause junctures. 


3.2. There is no doubt that once a number 
of related or unrelated languages have been 
structuralized, typological comparabilities 
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can be obtained for various purposes, ng 
only in phonological but also in other density 
ranges. Such typologies would be relevant tj 
translation problems, as already indicated 
and also to prediction in historical change, 
to distinguishing continuity and disconti. 
nuity in the changing structure of a languag 
family (i.e., accounting for the analogous a 
well as the homologous features in compar 
tive grammar), to the language learning! 
problem and psycholinguistics in general, ti! 
the nature of language or the ‘univers 
grammar’ problem, and finally to anthropo’ 
logical linguistics. 

The central interest of anthropologicd| 
linguistics is in the investigation of relatively 
unknown, unwritten, or unstructuralize 
languages. Its impact on linguistic theory 
and practice in general has been uneven. I 
has led us to look for the structure of ea¢ 
language in its own terms, with the implica! 
tion of thereby discovering its uniqueness, 
But its discovery procedures have been uti 
lized rather than explicitly stated. Hence i 
is now often said that they are unsystematic;) 
and it is further predicted that they are in| 
capable of being systematized. 

If discovery procedures are to be systems-| 
tized, the systematization would begin not 
with the uniqueness of structures, but in the 
framework of comparability that typology 
offers. It has already been assumed that i 
phonemicizing, it is most efficient to work i1| 
a context of possible phonemic distinction 
derived from a knowledge of the language’ 
of the world.§ Typology would help to mak 
explicit this assumption, and thereby might 
lead to a systematization of discovery pre 
cedures—in morphosyntax as well as il) 
phonology. 





4. Throughout these density ranges, wt 
are not using a yes-or-no basis for typologiz 
ing. We select, as the basis for density ranges) 
only what is actually known as occurrin 
with greater or less density, in all natur) 


6 See p. 3 ff. and fn. 2, p. 28 in Guide to Trat} 
scribing (Anthropological Linguistics 1.6, 1959) 
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languages. When the minimum of occurrence 
is stated as zero, the statement is relevant to 
certain sequences in a language which also 
shows greater than zero in other parts of the 
range in question. The density ranges dis- 
cussed in this paper illustrate this. 

Every language known to us includes 
stems in some sequences, though some words 
in some languages occur with zero stem. 

So also, every language known to us in- 
cludes some affixes, but here the density 
range varies enormously. The occurrence of 
particular classes of affixes, as suffixes, may 
be zero in reference to all words of verb 
class, as in Navaho; but there still is a low 
density range of suffixes in Navaho words of 
noun class. Conversely, the occurrence of 
prefixes is almost zero in Eskimo, but not 
quite; there is one prefix in Eskimo.’ 

Instead of saying that there are languages 
without any prefixes, we give the density 
range of occurring affixes (suffixes) with zero 
for the prefix inventory, and hence zero for 
all text occurrences. In consequence, we 
avoid speaking of a prefix type of language 


7The prefix is tak- in sequence with stems 
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Swadesh, South Greenlandic (Eskimo), VFPA 
6.36 (1946). 
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or of a suffix type of language. Instead, den- 
sity range typology compares languages in 
terms of the paucity or richness of affixes in 
respect to stem classes. 

If, now, a language were found without 
any affixes, it would not be comparable to 
the languages actually compared in density 
range typology. Parallel statements are pos- 
sible for operators. 

This requirement—that some part of a 
density range have actual membership— 
serves as the criterion for typological utili- 
zation or non-utilization in phonology too. 
The usual distinction between vowels and 
consonants can of course be utilized, since 
all known languages will show actual occur- 
rence of each. 

Morphemes are phonemically  distin- 
guished by tone in some languages, but not 
in others. Rather than setting up a type of 
tone language versus a type of toneless lan- 
guage, density range typology would set up 
tone as a particular kind of series generating 
component (SGC). It seems safe to say that 
every language will include some SGC. In 
one language the SGC will be tones, in an- 
other stresses, in another nasalization—com- 
binable with vowels and thereby generating 
unit phonemes. 








SIERRA POPOLUCA MORPHOLOGY 


Bren ELson 


SuMMER INsTITUTE OF LINGUISTICS 


0. Introduction 

1. Word inflection 

2. Person markers 

3. Verb inflection 

4. Verb derivation 

5. Substantive inflection 
6. Substantive derivation 
7. Numeral inflection 

8. Participle derivation 
9. Particle derivation 


0. Sierra Popoluca' has four INFLEC- 
TIONAL SYSTEMS and one inflectional sub- 
system. A sysTEM consists of a group of 
affix morphemes having roughly the same 
privileges of occurrence. One inflectional 
system occurs (potentially) with all words 
except a few particles, the second occurs 
with verbs, the third with substantives, and 
the fourth with numerals. Four root classes 
are distinguished: verb roots, substantive 
roots, particle roots, and numeral roots. 
Depending on their occurrence with the 
inflectional systems there are five sTEM 
classes, but including internal composition 
as an auxiliary criterion six stem classes are 
distinguished: A-particles, particles, verbs, 
substantives, numerals, and participles. 

Using other criteria it would be possible 
to arrive at still other classifications (for 
example, using participle derivation as a 
criterion three stem classes are obtained: 
verbs, nonverbs, and A-particles), but the 

1This paper is a shortened version of the 
author’s doctoral dissertation Sierra Popoluca 
Morphology, Cornell University, 1956 (to be pub- 
lished in Spanish translation by the Universidad 
Veracruzana, Jalapa, Ver., Mexico). Other publi- 
cations on Sierra Popoluca are: Elson, Ben, Sierra 
Popoluca Syllable Structure, IJAL 13.13-17 (1947), 
Foster, George M. and Mary Foster, Sierra Popo- 


luca Speech, Smithsonian Institution, Institute of 
Social Anthropology, Publication 8 (1948). Pp. 45. 


six stem classes listed above seem to be the 
most useful and to correspond most closely 
with important syntactic clssses. 

The large classes of substantives and 
verbs are broken down into subclasses 
depending on their occurrence with inflec. 
tional subsystems. In making these sub- 
classifications the person markers play an 
important part. 

There are three DERIVATIONAL systems: 
morphemes used to form verb stems, to 
form substantive stems, and to form par- 
ticiple stems. 

As indicated above we distinguish be. 
tween derivation and inflection, i.e., be. 
tween stem formation and word formation. 
However, the line between these two is 
difficult to draw. Our procedure has been to 
decide more or less impressionistically which 
elements constitute derivation and which 
inflection, then to propose a definition of 
the difference between the two and adhere 
rigidly to the definition. Such definitions 
will be given for each stem class separately. 

Inflectional morphemes, in _ principle, 
occur with all of the stems in the class which 
is defined by that inflectional system. Hovw- 
ever, many combinations that should be 
possible have not been observed; undoubt- 
edly many simply do not occur in the 
language. As Wonderly? suggests, thes 
“combinations do not occur because of 
limitations of the general grammatical 
structure and because of restrictions due to 
semantic or cultural context.’”’ We have 
tried to indicate where, we feel, the restric- 
tions are clearly due to “the general gram- 
matical structure.” It is obviously im- 
possible to indicate, in a work of this scope, 


2 Wonderly, William L., Zoque I: Introductio 
and Bibliography, IJAL 17.3 (1951). 
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eases Where the nonoccurrence of a given 
combination is due to semantic factors. 

Any single-morpheme utterance consists 
of an A-RooT. Different kinds of A-roots are 
determined by different privileges of occur- 
rence. Morphemes which parallel A-roots 
in most of their privileges of occurrence but 
have not been observed to occur as single- 
morpheme utterances are B-Roots. There is 
no essential difference between A-roots and 
B-roots other than that given, we therefore 
speak simply of Roots. 

Any morpheme that is not a root is an 
arrrx. As indicated above affixes are either 
inflectional or derivational. A single root, 
or a sequence of root plus other morphemes 
which may occur with inflectional affixes is a 
steM; derivational affixes occur with roots 
or stems to form other stems. A worp is any 
stem which occurs in an utterance with or 
without inflectional affixes, or is any root 
which occurs in an utterance to which no 
afixes may be added. It would also be pos- 
sible to define a word as any morpheme or 
morpheme sequence with which either a 
word suffix (see 1), or no suffix at all, may 
occur. 

There is a small class of particles that do 
not occur with any affix of any system. We 
call these A-PARTICLES. The following is a 
representative list: id’ak action not in 
progress, yakiii almost, just now, wAi**p 
may, might, ma?-m already, really, mu time- 
subordinate clause indicator, yili probably, 
teh let’s go!, hére that. 

Others are borrowings from Spanish: 
ni neither, nor, i and, éntre and, kon with, 
porke because, péro but, kasi almost, donde 
from, dénde from then, there, kasi almost, 
para for, de of, from. 


1. Except for the small class of A- 
particles, the affixes of the word-inflection 
system occur with the members of all other 
classes in the language. There are two 
classes that occur only with word-inflection 
affixes: particles and participles. These 
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classes are distinguished from each other 
by different methods of stem composition. 

The affixes of the word-inflection system 
are: 

Order I: -nam still, -am/-m* now, just 
now, -t¥im‘ also, just; as in anAkpatYim I’m 
going also, yo-yat¥im just a pig, d¥At¥im also 
no!, he?m éywK?:pt¥im the one who also 
went, pA-Si?fiam now a man, hésagam just 
then, anAkpam I’m going now, cA‘Sifiam 
still a child, d¥4nam not yet, n&kpanam he 
is still going. 

Order II: -wey I said, -un he said, -hak 
Oh, I see, -kip you see;> as in Simpawey a 
dog, I said, anAkpawey I’m going, I said, 
hi?-wey yes, I said, d¥4?un no, he said, 
&tébedun it” it is in Soteapan, he said, 
iki’dun s&'mii he ate bananas, he said, 
yoyahak 7t’s a pig, J see, mffipamhak he’s 
coming now, I see, d’agip t¥i: you’re welcome, 
wigip okay, fine, sk?*pkip now, you see, 
mifipakip he’s coming, you see.® 


2. The person marker subsystem of in- 
flection occurs with both verbs and sub- 
stantives. Its range is broader than any 
other inflectional system except word in- 
flection. There are three sets of person 
markers. Set I: a- first person, ta- inclusive 
person, mi- second person, (third person 
is unmarked); Set II: an- first person, tan- 
inclusive person, iii- second person, i- third 
person; Set III: man- first to second person, 
an- second to first person. 


3-am occurs after consonants, -m occurs after 
vowels. 

‘It is possible that -tYim could be analyzed as 
-t¥i (see 5.1.) plus -am. However, the fact that 
such forms as wifit¥it¥im before that, first are pos- 
sible, where both -t¥i and -t¥im occur, and the fact 
that -t¥i does not occur with verbs (unless it is a 
part of -tYim) suggest that they are two different 
morphemes. The meanings, however, are similar; 
there is undoubtedly an historical connection be- 
tween them. 

5 -kip has an alternant -gip after monosyllables 
ending in a vowel. 

6 There is also a form -can that occurs in one or 
two set phrases before -t¥im: wadpcant¥im uSan 
he can aiso do quite a little. 
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All three sets’? occur with both verbs and 
nouns (nouns are the largest subclass of 
substantives—other subclasses of substan- 
tives are determined mainly by the way in 
which they occur with the person markers). 

In addition to person, every occurrence 
of a member of Set I marks PARTICIPANT, 
every occurrence of a member of Set II 
marks ASSOCIATE, and every occurrence of 
a member of Set III marks both participant 
and associate. If either first or second person 
is involved it will be marked; third person 
is marked only if no first or second person 
is present. 

The participant is defined as the person 
most closely connected with the action or 
situation; the associate is one step removed 
from the action or situation. Set I, marking 
participant, appears (a) with nouns: apA‘sifi 
I am a man, tay6"mo you and I are women, 
mitA‘Si you are a child, kAwah or he? kawah 
it is a horse; (b) with intransitive verbs: 
anikpa I’m going, tahé-ypa you and I take 
a walk, mimifipa you come, wi?kpa he eats; 
(c) with transitive verbs plus -ta: indefinite 
performer: ako?cté:p I am hit, someone hit 
me, takwi?kta:p you and I are fed, mi?a?- 
mtd‘p someone is looking at you, kutta:p 
someone is eating it; (d) with reflexive stems 
(these must also be inflected with -ta: 
indefinite performer): anak6é?-ctéyp I am 
hitting myself, miniSé-nta-p you are hitting 
yourself (with your fist), tanatA-nta-p you 
and I are chopping ourselves, nakkaé?-ta:p 
he is killing himself. 

The participant may also be marked 
syntactically: ko?cté-p Siwan someone is 

7™There are obviously possibilities of further 
analysis (or subanalysis): a- first person, t- inclu- 
sive, mi-/i-/m- second person, (i- occurs with asso- 
ciate, m- with first person, mi- elsewhere), third 
person unmarked, -n-/i- associate (i- occurs with 
third person, -n- elsewhere); when associate is not 
present participant is indicated. 

Set III can be reconstructed morphophonem- 
ically as mi- + an- > man-, and a- + ifi- > an-. 

Although the above is possible, since there 
appears to be little gain in analytical insight, we 
feel the best description results from the three 
sets presented. 
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hitting John or John is being hit, aé apA-Si 
I am a man, nkkpa Siwan John is going, 
natA-pta’ Siwan John chopped himself. 

Where both participant and associate 
may be marked syntactically either the 
participant or the associate may be marked 
by the person prefixes. First or second per 
son, in either category, takes precedence 
over third person; when both first and second 
person are involved set III is used. Such 
situations obtain with transitive verbs in 
both main and subordinate clauses: 

(a) the participant is marked: ak6é?cep 
he hits me, také?cpa he hits you and me, 
mit’éypa he loves you, miséppa Siwan 
mimié John hits you with his fist, k6?¢ it go 
bruised: (b) the associate is marked: ank6’. 
epa I hit him, tank6?cpa you and I hit him, 
mié inkdé?epa Siwan you hit John, ik6?ep 
he hits him. 

Set II, marking associate, appears (a) 
with nouns: antAk my house, tankdwah 
your and my horse, ink&:ma your cornfield, 
ih&-tuy his father; (b) with intransitive 
verbs in subordinate clauses: mu annik 
a’niS ci: when I went I saw a snake, ds 
wa?4'p ifAk he cannot go; (c) with transitive 
verbs plus -ta: indefinite performer in sub- 
ordinate clauses: mifipa iko?cta: someon 
will come and hit him. 

In subordinate clauses the use of set II 
with verbs where set I would otherwise lt 
expected, appears to indicate that the mean- 
ing of ‘associate’ may be extended to includ 
‘subordination of construction’ in most 
situations. 

Set III, marking both participant ani 
associate, appears (a) with transitive verbs: 





majké?cpa I am hitting you, apké?cpa you 
are hitting me; (b) with nouns: manhatu} 
you are my father, a?na-pa I am your mother. 


3. Any SP stem which may occur with) 
suffix -pa incompletive is a verb stem. Ver] 
stems are divided into transitive or intrans:- 
tive by virtue of different inflectional poss: 
bilities. Some stems participate in all the 
inflectional possibilities of either class, 
and are considered members of both classes} 
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Inflectional differences between transitive 
and intransitive stems occur at three points 
in the inflectional system: (a) different 
person markers are used; (b) number 
suffixes function differently when occurring 
with intransitive stems than when occur- 
ting with transitive ones; and (c) different 
indefinite performer suffixes are used. 

3.1. There are two orders of inflectional 
prefixes and six orders of inflectional suffixes. 
These affixes show certain relationships 
among themselves which will be mentioned 
as they are discussed. 

Prefix order I contains a single prefix 
wiga- together: wadgaoyya4h they went to- 
gether, miwdganakté?mpa kon pé:toh you 
will go with Peter, aywagako?cté?mpa we, 
together, hit him. 

Prefix order II contains the three sets of 
person markers (2 above). 

Suffix order I contains two number 
suffixes: -ta?m/-ta® first and second person 
plural, and -yah third person plural. With 
intransitive stems -ta?m/-ta occurs with 
firs; and second person prefixes, -yah 
occurs with the bare stem (or with the 
third person prefix in subordinate clauses) 
marking plural participant: anakté?mpa 
we (excl.) go, tawit’t’4?mpa we (incl.) walk, 
mipoyt’é you ail ran, kekydhpa they fly, mu 
ikekyAh when they flew. With transitive 
stems -ta?m/-ta occurs with first and second 
person prefixes to mark plural participant 
or plural associate depending on which is 
indicated by the person markers; with Set I 
plural participant is marked, with Set II 
plural associate: anko?cté?mpa we (excl.) 
hit him, ako?cté?mpa he hits us (excl.). 
With transitive stems -yah occurs with first 
and second person prefixes of Set II marking 
plural third person participant; it indicates 
plural third person associate when occurring 
with the person prefixes of Set I. When -yah 
occurs with the third person prefix of Set 
II, it ambiguously marks plural participant 


§ -ta occurs word final, -ta?m occurs elsewhere 
(but see fn. 15). The glottal catch is lost when the 
sufix is unstressed; this is also true of all mor- 


phemes. 
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or plural associate or both: anko?cyahpa J 
hat them, ako? cyahpa they hit me, iko?cy&hpa 
he hits them, they hit him, they hit them, 
mu iké?cyahta: when someone hit them. 

Order II contains two suffixes: -ta: 
indefinite associate and -na'm indefinite 
participant. -ta* occurs only with transitive 
stems; its function is to cancel the possi- 
bility of indicating the associate. When the 
verb is in a main clause only Set I person 
prefixes occur; when the verb is in a sub- 
ordinate clause only Set II person prefixes 
occur: ak6é?ctamta: someone hit us, ku?tté- 
wom someone ate it, mu iké?cta aktAy 
when someone hit him he fell. -ta: is obliga- 
tory with reflexive stems (see 4.1): anak6é?- 
‘eta:p I am hitting myself. 

The suffix -na‘m indefinite participant 
occurs only with intransitive stems and no 
person marker is present; its function is to 
cancel the possibility of indicating the par- 
ticipant. Such constructions have not been 
observed to occur in subordinate clauses: 
waé'nni‘mpa® someone is singing, wi?- 
‘kyahni-mpa people are eating. 

Order III contains a single suffix -gak 
repetitive which may occur with both transi- 
tive and intransitive stems: ak¢intagakum 
someone bathed him again, ndkyahgdkpa 
they are going again. 

Order IV contains two suffixes: -to de- 
siderative and -ta?p frustrative which occur 
with both transitive and intransitive stems. 
However, -to occurs only when -pa (see 
Order V) is present and -ta?p only when -pa 
is not present:!° nakt6?:ba" he wants to go, 
is about to go, hé-ygakt6?-ba he wants to go 
for a walk again, tanki?ttamté?:ba we 
(incl.) want to eat tt; a?dyt¥amti?p we 

9 The vowel of the root which precedes the suffix 
-na'm is lengthened when the root morpheme has 
the phonemic shape CVC (where V is any short 
vowel nucleus). 

10 Actually -to and -ta?p could be considered 
allomorphs: -to occurs with incompletive, -ta?p 
occurs with completive. However, we prefer to 
treat them as separate morphemes, admitting 
that the preference is probably arbitrary. 

11The morphophonemic sequence -to + -pa 
yields the phonemic sequence -t6?-ba. 
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(excl.) went in vain, akkd?-yahtA?p they 
tried to kill me but failed. 

Order V contains three suffixes. One 
suffix -pa/-p! zncompletive occurs with verb 
stems (both transitive and intransitive) in 
main clauses or in clauses introduced by 
the particle iga: ahé-ypa J take a walk, I will 
take a walk, miwi’kpa you eat, you will eat, 
it’Apyahgakté?*ba they want to chop it 
again, nanakta:p someone takes him. 

Two suffixes occur with verbs in sub- 
ordinate clauses: -pa‘m incompletive sub- 
ordinator and -wa‘m completive subordinator. 
When these forms occur with intransitive 
stems Set IJ person markers are used. 
Examples: imifip4'm ip4?t ti-m uspifi while 
coming along he found an alligator, tog6yum 
hémak antA?*bapé’m he got lost there while I 
was fishing, ammift’anwi'm akcA?yt¥a 
tuih when we came, the rain caught us, hésak 
iko?ewh'm kit® ikA then when he hit him he 
broke his hand. 

Order Vi contains one suffix -um/-wom"™ 
definite completive which follows -wa‘m of 
Order V:" sé‘tum he returned, iiAknewA-- 
mum after he went. 

Two modal suffixes occur which are 
mutually exclusive with those listed in 
Orders IV, V, and VI: -if obligatory and 
-A/-O'© imperative. Constructions in which 

12 -py occurs optionally (a) after -to (anakté?p 
or anakté?-ba I want to go, am about to go), and (b) 
after a suffix or dissyllabic stem ending -V(?)y 
(iku?dé?p or iku?ddé?ypa he eats the other’s (food), 
hé-m tagA?p or hé-m tagdypa there he is going in). 

18 The two forms occur optionally. Some speak- 
ers prefer one form and some the other, though 
most speakers use both. -wom is the preferred 
form in slow speech. 

144-ym is undoubtedly a combination of *-u 
completive (cf. Wonderly, 1951, p. 141) and -m 
now, just now (see 1) which has frozen as a single 
form. 

16 -Q indicates that there is no overt affix pres- 
ent. It occurs optionally when a stem contains 
-a?y (see 4.1.) and/or -ta?’m: amdcayt¥4?-ma or 
amdcayt¥am You (pl) grab it/them for me! 

The vowel immediately preceding a is always 
lengthened if possible (i.e. unless the preceding 
vowel is followed by a consonant cluster): pet + 
-A > péta sweep it! but poks + -a > pdksa 
double it (i.e. corn) down! 
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-Ii is present occasionally occur as inde. 
pendent syntactically, but are more fre. 
quently preceded by the particle iga that, 
in order that: andgi?fiam I should go nou, 
ik6’¢in he ought to hit him, ifk?ma?y igg 
mi?’niii he told him that he should come, 
iSunpa iga inindgii he wants to take it. 

Verbs containing -a/-0 imperative ar 
always independent constructions. Every 
occurrence of -A/-0 with intransitive stems 
indicates second person participant; with 
transitive stems it indicates second person 
associate; a first person participant may be 
marked in addition: pé-ya run!, mifit’4?*m, 
all of you come!, nakga'ka go again!, k6?-c, 
hit him!, ko?tté?*ma all of you hit him’, 
ko?cyaé:ha hit them!, nak6é?-cta:ha hit your. 
self!'6 

3.2. There are two kinds of verbal aux. 
iliaries which should be included in a treat. 
ment of verbal inflection. 

One type of verb phrase is characterized 
by the occurrence of a single set of inflee. 
tional affixes with the verb complex (aux. 
iliary stem plus main stem). There is a 
small group of intransitive verbs that occu 
both as auxiliaries and as single intransitive 
verbs: nak to go, mifi to come, oy to go ani 
come, moh to begin, yah to finish, ha?y w 
prolong, kus to be satisfied. Each of thes 
roots may also occur with -ne (see 4.1), 
There are two verbal particles whose fune- 
tion parallels that of the verb roots above 
except that they are uninflected: teh com: 
on!, ta?*mu let’s go! 

Inflectional affixes occur as follows (the 
auxiliary verb always precedes the main 
verb): prefixes and suffixes of Orders I ani 
II occur with the main verb, Orders IV, \, 
and VI, -ifi, and -a occur with the auxiliary 





verb. When the main verb is transitive -1| 
occurs with both the auxiliary verb and the 
main verb. There is a suffix -i” intransitiv' 


16 The root vowel is lengthened when the re 
flexive stem occurs with -ta (see 4.1.). 

17 -j is not considered a regular part of verb in- 
flection, discussed in 3, because it is limited (with 
a few exceptions) to auxiliary constructions. W 
view it as being more closely related to the deriva 
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main stem indicator which is obligatory with 
intransitive main stems (preceding Order I 
suffixes): nAk i?a?my&h they went to see it, 
mifit’6?*ba ma?na?mgék I want to come 
and see you again, dyt¥A?p wi?:ki he went 
there to eat in vain, mohgékum iwdt he 
began to make it again, mi‘iia mikovfii come 
and sit down!, n&-ka pA‘ka go and get it!, téh 
tahé:yi come on for a walk’ 

A few verbs of motion, nak to go, mifi to 
come, set to return, have been observed to 
occur with -i (see above) even though the 
auxiliary verb is not present: mf-fii tunné"m 
he is/was coming in the road, niki &:tébet 
he is/was going to Soteapan. 

In certain verb complexes when the main 
verb is a transitive stem, the derivational 
prefix ku- (see 4.3.) and the imperative 
suffix -a (see 3.1., 6.1. and ef. -i discussed 
above) occur with the transitive stem. The 
auxiliary verb is inflected as described above. 
The meaning of this construction is similar 
to the meaning of the constructions where 
-ta’ indefinite associate is present: nAkpa 
ikumé?-ca someone goes to look for him. 
Only a few instances of this combination 
have been observed. 

When the suffix -ta: indefinite associate 
occurs with the main stem in a verb complex 
the person prefixes of Set II occur (as they 
do when this construction is in a subordi- 
nate clause): nAkpa i?a?mta&: someone goes 
to look at it or someone goes and it is looked 
at, mifipa tanko?ct&é: someone comes to hit 
you and me, but mifipa také?c he comes to 
hit you and me. 

The second auxiliary verb phrase type 
contains a single auxiliary stem sa to be 
about, to walk. It is inflected with Set II 
person markers only, and follows the main 
verb which is fully infiected. The person 
marker occurring with sa must match the 
one that occurs with the main verb stem: 
andkpa ans I’m going along now, anké?cpa 
ansk I’m hitting him now, wi?kpa iSK he is 
eating now. 





tional (see 6.1) than to the inflectional system, but 
as an integral part of neither. When -i occurs the 
preceding vowel is lengthened if possible. 
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4, There are three criteria used to dis- 
tinguish SP verb derivational affixes from 
inflectional ones: (a) any morpheme which 
may occur immediately following the re- 
duplicative morpheme (4.3) is derivational; 
(b) any affix which may occur with a root 
in such a way that the two together (root 
and affix) form an immediate constituent 
with another root, the whole then forming 
a stem, is derivational; (c) any affix which 
may occur in more than one linear arrange- 
ment within words, relative to roots and/or 
other affixes, is derivational. Any affix 
meeting any one of these criteria is con- 
sidered derivational in all of its occurrences, 
though in other respects it may function 
like an inflectional affix. 

A stem which is also a root is a simple 
stem; a stem which consists of more than a 
root is a derived stem. Two basic methods 
of derivation are recognized: affixation and 
compounding. In some complex stems these 
two methods may overlap or be mixed 
together in a complicated way. 

Stems are divided into three classes based 
on the different privileges of occurrence of 
the morphemes which comprise derived 
stems. Thus, some affixes occur only with 
verb roots (with a very few exceptions), 
others occur only with substantives and 
particles to form verb stems which, in some 
ways, then function like verb roots. We call 
these forms, including roots themselves, 
INNER FORMATIONS. Inner formations occur 
together to form coMPpouNDS. Compounds 
may also be formed by combining verb 
roots or stems with substantives or particles. 
Finally, either inner formations or com- 
pounds may occur with still another group 
of affixes to form new stems; these we call 
OUTER FORMATIONS. 

4.1. Seven affixes may occur singly or in 
combinations with inner formations or com- 
pounds to form stems: -ne durative, -a°y 
benefactive-indirective, -ka instrumental, -o?y 
indefinitizer, ak- causative, na- associative- 
causative, nas- reflexive-reciprocal. 

All stems in which one of these affixes 
is an IC are said to be outer formations; the 
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affixes are outer affixes. Occasionally, these 
affixes function like inner affixes in that the 
resulting stem may occur in compounds. 

The usual arrangement of outer affixes, 
both linear and hierarchical, is indicated in 
the following chart. Numbers indicate the 
usual order in which ICs are obtained by 
successive cuts. 


2 6 5 4 3 1 
nao- ak- Stem -o?y -ka -a?y -ne 
na- 


-ne/-ni'® durative!® usually functions like 
an inflectional suffix. It may occur with all 
verb stems, and is usually followed by no 
aspect suffix at all or by -um definite com- 
pletive indicating that the action of the stem 
was accomplished in the past, but that the 
results continue into the present. It may, 
however, also be followed by -pa incom- 
pletive: nakné he has gone, iku?tné he has 
eaten it, iko?nné?-ba he is carrying it along 
on his shoulder, ikA-nkanké-fikané he has 
sat on it down on it fearfully, n&kkpa i?4?- 
mamuné he is going to look at it over and over. 

nas- reflexive-reciprocal occurs with a large 
number of transitive roots and stems. When 
nae- is present such stems are termed re- 
flexive stems. When a reflexive stem occurs 
with number suffixes it is reflexive or re- 
ciprocal. Reflexive stems are always in- 
flected with -ta: indefinite associate. The 
difference between nae- and na- associative- 
causative is that when nae- is present the 
accompanying root contains a long vowel. 
Compare the first two illustrations. natA-nta: 
he cut himself < tan to chop something, 
natanta: (iké-ma) someone cut (his corn- 

18 -ni occurs unstressed, -ne elsewhere. 

19 This suffix was considered inflectional by the 
Fosters. Actually, it could be handled in any of 
three ways: (a) as a derivational affix (it is so 
treated here) which has very wide distribution 
paralleling the distribution of inflectional affixes; 
(b) as an inflectional affix (as treated by the 
Fosters) which has wider privileges of occurrence 
than do other inflectional affixes, paralleling in 
some occurrences derivational affixes; (c) as two 


different morphemes, one inflectional and one 
derivational. 
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field) with him < tay to chop something, 
anak6é?-cté: I hit myself < ko?e to hit 
something, minimaéctamta:p you are grabbing 
each other < mac to grab something, nasé- 
pyahta: they hit each other with their fists < 
son to hit something with the fist. 

-a’y benefactive-indirective occurs with 
most transitive stems and a number of in- 
transitive ones; the resulting stems are 
transitive (with a few exceptions). In gen. 
eral, three functions of -a?y may be dis. 
tinguished: 

(a) Along with other outer affixes -ay 
occurs with some roots which are not stems 
in order to form stems. 

(b) When -a?y occurs with some intransi- 
tive stems, usually transitive stems are 
formed: ifia?m4?ypa he says to him < nam 
to say, iha?y&?ypa he speaks to him < hay 
to speak, it¥égo?ya?y he lost it < togoy t 
be lost, destroyed. 

(c) When -a?y occurs with transitive 
stems, it indicates that the action expressed 
in the stem has reference to or a connection 
with someone or something that is not the 
direct participant in the action. This ind: 
rect participant, if non-third person, is 
marked by the person prefix, and otherwise 
may be understood from the cultural con- 
text, or from the non-immediate linguistic 
context, or may be marked syntactically in 
the same clause with the verb. The following 
statements describe the formal features 
relative to the occurrence of -a?y. (1) When 
the person marked by the person marker 
(participant class) is not the direct par- 
ticipant -a?y must occur. The direct  par- 
ticipant may be any person. (2) When the 
syntactically marked direct participant 
(noun) is inflected with a possessive person 
marker which does not refer to the associate, 
-a?y will be present in the verb stem. 
Examples are: iw-cdkpa pwé:nteh, ipaga?- 





ypa cdy he fixes the bridge, he gets vine for 
it < pak to get something, inkéhay wé?-fiak| 
he showed wasps to him < ankeh to exhibit! 
something, nanXgayt¥a hikta someone took’ 
fire to him < nanak to take something, 
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&é?nay Siwan ipé?kpak he stabbed John’s 
ribs < cey to stab something, nAkpa mam- 
mé?ca?y imma‘nak I’m going there to search 
for your child < me?c to search for some- 
thing, (but) ndkpa ammé?c ammé4:nak I’m 
going there to search for my child, it¥’Aqna?y 
ikX he cut the other’s hand < tay to chop 
something, (but) itYAn ikA he cut his own 
hand. 

Occasionally, -a?y occurs twice in a stem. 
This indicates a second indirect participant: 
anankga’ya?*ya take it to him for me < 
nanAk to take something, it%éba?ya?y ikuéi-- 
yuh he took his knife away from him < top 
to take something out, ihh?-ga?yA?ypa 
ikAwah he leads the other’s horse for him < 
ha'k to lead something. 

-ka instrumental occurs primarily with 
intransitive stems; the resulting stems are 
transitive. It generally indicates that the 
participant is the means, or instrument, by 
which the action indicated in the stem 
(without -ka) is performed. Often basically 
transitive roots occur with -o?y; then this 
new stem occurs with -ka: it’A?noyka?*ba 
he chops with it < ta?no*y to be chopping, 
antéfiké?*ba I stand on it < te'fi to stand 
up, imA*tké?-ba he plays with it < ma‘é to 
play, anki?nk& I got up on tt < ki?m to 
go up. 

-0?y indefinitizer occurs primarily with 
transitive stems; the resulting stems are 
intransitive. It serves to take the emphasis 
from the ‘object’, implied in a transitive 
stem, and place it upon the action of the 
stem: ydéSpa?dé?ypa he will help, he is 
helpful < yo:Spa?t to help someone, cagé? ypa 
it moves < cak to touch something, tuhdé?ypa 
he will shoot, he does lots of shooting < tuh 
to shoot, shoot something. 

ak- causative occurs with a large number of 
stems; with intransitive stems the resulting 
stems are transitive; with transitive stems 
a second participant is added: iknAkpa 
he sends him < nak to go, ikkdé?cpa he 


thing, ikA?-ba he kills it < ka to die, ikkuwe?- 
ntAkpa he causes it to flash up < kuwe?ntak 
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to flask up, ikwA?kpa he questions him < 
wa’k to ask for something. 

na- associative-causative occurs with both 
intransitive and transitive stems; the 
resulting stems are transitive; with transi- 
tive stems a second participant is added. 
na- generally indicates a kind of causative 
situation in which the associate also par- 
ticipates, taking a subordinate part, in the 
action: ininkkpa he takes it < nak to go, 
ini?it” he has it < it¥ to be, exist, inihédkspa 
he causes the other to hoe it along with him < 
hoks to hoe something, iniydSp4?tpa he 
causes the other to help him with himself < 
yo'Spa?t to help someone. 

Changes in the arrangement of outer 
affixes are possible. There are a limited 
number of SP verb roots that are not stems. 
Some of these occur with inner affixes (see 
4.3); a few occur with outer affixes to form 
stems. Usually stems formed in this way 
function as inner formations and may occur 
as members of compounds, or even (rarely) 
with inner affixes: inimétpa he mizes it 
together < mot underlying root, magé?ypa 
he lies < mak underlying root, ha?d&?ypa 
it ts carried downstream < ha?t underlying 
root, tA:né?*ba he stands quietly < ta’ 
underlying root. 

There are a few roots that are also stems 
but whose meanings cannot be predicted 
when occurring with outer affixes: aktAnpa 
it falls < tay to chop something, ipakka?-ba 
he knocks him down < pak to bump some- 
thing, ipacd?ypa he throws it away < pac 
to throw something, mamné he is ill << mam 
to lay something down horizontally. 

A few stems have a rearrangement of the 
outer affixes which occur in the stem: 
nanikne?6?y he owes [money], ik6?npakne?- 
4?y he carried it along on his shoulder for 
him. 

In the statement of the outer affixes 
given above successive IC cuts would first 
cut off -o?y then ak- at a later stage: 
akka?6?ypa he will kill < akka to kill 
something. However, in some forms ak- is 
removed by the first IC cut, even though 
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-o?y is present in the stem: ikmaéto?poy 
téhka he caused the deaf person to be able to 
hear < mato?no’y to be hearing, iktihé?ypa 
ttih he caused it to rain < thoy to be 
beating. 

4.2. Any stem which contains two or 
more verb roots, or one or more verb roots 
and one or more substantive or particle 
stems is a compound verb stem. SP verb 
compounds fall into two basic types: verb- 
element compounds and compounds which 
contain a substantive or particle. 

Three types of verb-element compounds 
are recognized: special compounds, regular 
compounds, and enclosed compounds. 

There is a relatively small group of roots 
and stems larger than roots that appear to 
have special compounding possibilities, in 
that they may occur with regular compounds 
and enclosed compounds, as well as with 
roots and inner formations. Stems in which 
one of these special compounding forms is 
an IC are special compounds. Two special 
compounding forms occur preposed and 
the rest occur postposed to other stems. 
Many of these special compounding forms 
have a slightly different meaning when so 
occurring than they do when they occur as 
the nuclear element of a stem. At least one, 
kuru't, is not itself a stem. Three others, 
poy, cak, and nas, have been observed to 
occur with outer formations. 

The special compounding verb root poy 
(basic meaning: to run, flee) occurs preposed 
to a number of different stems. In this 
position poy means a litile while, quickly: 
poyhéhpa he rests a little while < heh to 
rest, pant, ip6?yanhd:ppakcdékpa he closes 
it by rolling something in front and leaves it 
for a little while < anho-ppakcak to close 
something by rolling something over it and 
leave it (in this example cak is also a special 
compounding form, poy has a somewhat 
wider distribution than the other forms of 
this class). 

The second preposed special compounding 
form is apnhe?-ga to hurry: aphé?-gatikpa 
I’m going to hurry and pick my corn < tik 
to pick corn. 
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A few examples of the special compound.| 
ing forms that occur postposed are given 
below. Others will be found in the full 
Spanish version. 





nas to do first, do just before (basic mean- 
ing: to pass by): awi?kndspa I’ll eat first <| 
wi’k to eat, inwéhpa?tndspa he prays t| 
him beforehand < aywehpa?t to pray to| 
someone. ket to finish (basic meaning: 
go down): ihokskétpa he finishes hoeing it <| 
hoks to hoe something. moy to do in the 
evening (basic meaning: to sleep): wi?kméyppa 
he is eating supper < wi?k to eat, insdshak- 
méppa he cooks a lot of it in the evening </ 
ansoshak to cook a lot of something. kurut| 
to turn out partly good and partly bad: 
iwatkurtit he made it but it did not turn 
out well < wat to make, do something,| 
iSéskur't he cooked them but only part of 
them got done < sos to cook something. 
anhak to do for a long time (basic meaning: 
to lead, rule over someone): tA-ne?anhdk 
he stood quietly for a long time < ta-ne to 
be standing quietly, mo?nanhak he over! 
slept < mop to sleep. 

There are three classes of inner forma- 
tions which form regular compounds; thes 
are based on their occurrence in two pos: 
tions within the compound stem: thos 
occurring only in position I, those occurring 
only in position II, and those occurring in 
either position. Generally speaking re. 
duplicated forms occur in position I. The) 
action of the stem in position I either pre- | 
cedes in time the action of the stem in| 
position II or is somehow logically prior] 
to it. 

If both stems in the compound are in-| 
transitive the resulting stem will be in-| 
transitive; if one of the stems in the com- 
pound is transitive the resulting stem will 
usually be transitive. However, compounds 
containing a?m to look at something and an 
intransitive root are intransitive: a?nki?mpa 
he looks up < ki?m to go up, a?mnikpa he 
looks into the distance < nak to go. 

Regular compounds of root plus root: 
it¥anmacpa it pecks something <_ taj 
to chop something and < mac to grab some 
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thing, ihaypa’tpa he addresses him < hay 
to speak and < pat to find something, 
j?ispAkpa he ts acquainted with him < if 
to see something and < pak to take some- 
thing, ihé?-yi8pa he hates him < ho?y to be 
angry and < i8 to see something. 

Regular compounds of inner formations 
(larger than root) plus root: inwehpa?tpa 
he prays to him < anweh to shout and < 
pa’t to find something, itdganhé?k he 
frightened him < cak to touch something 
and < aphe?k to be startled, it*i?capdkpa 
he follows him < tu*ea: (cf. tu? tazl) and < 
pak to move horizontally against something, 
to bump something. 

Regular compounds of inner formation 
plus inner formation or outer formation: 
wone’aktin he lay down < worne to be 
lying down and < aktan to fall. 

A few compounds contain compounds as 
one of the constituent parts: it’a*mwA-cdkpa 
he stretched it into place < tam to stretch 
something and < war‘cak to fix something 
(from wa‘ good and cak to leave something). 

There are roots which occur in position 
II and form an IC with a prefix which is 
preposed to the root in position I. The two 
together (prefix and root in position II) 
then constitute one IC and the root in 
position I the other. These _prefix-root 
combinations are ENCLOSING FORMS, and 
the stems that are formed are ENCLOSED 
comMpouNDs. The prefixes used are the inner 
prefixes an- and ku- (see 4.3) and the outer 
prefix ak- (see 4.1.). A few examples are 
presented below; others will be found in 
the Spanish version. 

ku-..-a?y to do completely (ku?a?y) to 
become clean, clear): kunAgayydéh they all 
went away < nak to go, kutiks4é?ypa he 
itches all over < tiks to itch. an-..-a?y to 
open a space by ... (a?na?y to open some- 
thing): inpi?pa?ypa he clears a space by 
picking up some < pin to pick up something, 
incecd?y he chips off a small piece from the 
tree < cee to plane, shave wood. ay-..-nu?k 
fo close by ... (aynu?k to close, close some- 
thing): inta?nni?kpa he closes it by making 
a fence < ta?n to fence. an-..-hak to do a 
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great deal of (anhak to lead, rule something): 
insoshikpa he cooks a great deal so there will 
be enough < sos to cook something, inhi: 
shaékpa he imagines it < ha‘s to think of 
something. ku-..-ket (kuket to go down to 
someone): ikuyo?ykét he jumped down on 
it < yo?y to jump, kuchgoykétpa there is 
shaking down everywhere < cago?y to move.2° 

Nonverb-verb compounds are of several 
types. A very productive pattern is that in 
which the syntactic participant of a transi- 
tive verb is incorporated into the verb 
stem itself; the new stem thus formed is 
intransitive. Such a syntactic participant 
must, in every case, be a noun: ké-mtanpa 
he cuts a cornfield < tay to chop something 
and < kaé-ma cornfield, mokmé?cpa_ he 
searches for corn < me?c to search for some- 
thing and < mok corn, sakiiippa he plants 
beans < iip to plant, plant something 
and < sak beans, kawahakni?kpa he waters 
the horse < akna?k to cause to drink and < 
kd4wah horse, tigakté-fipa he builds a house 
< aktefi to cause to stand and < tak 
house. 

Other nonverb-verb compound patterns 
are much less productive. They include 
(a) noun-verb combinations: ahi-sit¥ogéy 
I got confused < togoy to be lost and < 
hA‘Si thoughts, ké:sté-ipa he kneels < tei 
to stand up and < ké:so knee, iké-bakpihpa 
he annoys him < pih to be hot and < k6é:bak 
head, ima'sanwatpa he blesses him < wat 
to do, make something and < mé&:san Span- 
tard, white man; (b) ‘adjective-like’ nouns 
with verbs: iwA-cdkpa he fixes it < cak to 
leave something and < wa: good, iyAkkAnpa 
he paints it black < kay to paint something 
and < yak black; (c) two special compound- 
ing substantives: téget responsibility, fault, 
and ka hand: tégetnXkpa he must go < 
nak to go, tégetwit’pa he limps < wit’ 
to walk, ikA?pinpa he tracks it < pip 
to pick up something, kA?-pit he escaped < 
put to go out, ikA?-hiiypa he hires workers < 
huy to buy something; (d) certain particles: 

20 This is the only bimorphemic enclosed stem 


(cak to touch something, -o?y indefinitizer) encoun- 
tered. 
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i¢4-mh6é?-yiSpa he hates him very much 
< ho?-yi8 to hate someone and < ca‘m very, 
amédvohi‘sit¥ogéy I got partly confused < 
ha‘sit¥ogoy to get confused and < med¥o 
half. 

4.3. In addition to the roots themselves 
there are three types of inner formations: 
stems with inner affixes, verbs derived from 
nonverbs, and uniquely derived stems. 

There are two inner prefixes, ku- and 
an-, which differ from outer affixes in that 
they rarely form an IC with more than a 
single root. These prefixes, ku- and ap-, 
are mutually exclusive, and ay- is also mutu- 
ally exclusive with ak- and occurs rarely 
with na-. 

The core of meaning of the prefix ku- 
seems to be otherness. This basic notion 
manifests itself in two ways: a distributive- 
repetitive connotation suggesting other 
occurrences of the same action in time or 
space, and a ‘separation’ from the normal 
routine of action the root would suggest 
with a ‘specialization’ which may involve 
another person. With either of these mean- 
ings ku- is still mildly productive. 

ku- occurs (a) with some roots that are 
not stems: kunu'm to stalk something, ku- 
musuy to swim under water, kucat to send 
someone; (b) with stems which denote the 
action has broad spatial dimensions; most 
of these are verbs derived from nouns by 
the affix -a°: kupi?*¢a° to get dark < pi?-éa 
darkness, kupa?ma: to be covered with weeds 
< pam weeds, kuhehe?a to be surrounded 
with mesquitos < hehe mosquitos, kukeh 
to dawn < keh to appear, be visible, kutogoy 
to be covered (as by tall grass) < togoy to be 
lost, kumoyn to sink < mon to sleep; (c) to 
denote that the action is repeated, often in 
different places: ku?a?m to look for some- 
thing < a?m to look at something, kute-ii to 
stop repeatedly < tefi to stand up, kuhop 
to roll about < hop to roll, kupin to sort 
something < pin to pick up something, 
kutan to test by chopping something < tan 
to chop something; (d) to denote a specializa- 
tion of action related to another person: 
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kuhay to speak evil of someone < hay t 
speak, kutagay to defend someone < tagay 
to enter, kunu?k to go up to someone < nu 
to arrive, kuhoks to hoe at another’s place < 
hoks to hoe something, kupak to belie 
obey someone < pak to take something. 

The core of meaning of the prefix ap. 
seems to be togetherness. Derived from this 
basic idea are such notions as ‘concentration 
of action to a point’, ‘motion toward 
‘closing’, that are present in most of th 
stems in which an- occurs. Ia general, ap. 
appears to be somewhat less producti 
than ku-. 

al)- occurs (a) with some roots that ar 
not stems: ansah to open one’s mouth, 
anmat to speak, visit, a?na: to fight, argu, 
anya?k to be sad; (b) to denote the action 
of ‘closing’: anmifi to heal < mifi to com, 
annu?k to close, close something < nuk ti 
arrive, aypah to close a door or gate < paht 
pen up something; (c) to indicate motion 
objects toward each other or of one objec 
toward another in a fixed position: a?na?mt 
wait for someone < a?m to look at somethin, 
ante i to le in wait for someone < te’ii ti 
stand up, annak to chip metal < nak to gu 
(d) to indicate concentration of action toi 
point: anhay to sound, bang < hay to speak, 
antan to chip off a piece of bark < tay t 
chop something, aymeh to sharpen somethin} 
< meh to rub something; (e) with an inten 
sive, often purposive, meaning: anwé 
to shout < weh to weep, ankeh to exhibi 
something < keh to appear, be visible 
anhak to lead, rule someone < hak to crosi 
cross something, ayntagay to work hard < 
tagay to enter. 

Inner suffixes include a directional suffi 
and the reduplicative suffix. 

-A?yA?y/-a?y*? direction along a_ lin 
occurs with a few roots. When this suffi 
is present no other derivational affixes have 
been observed to occur: yA?m ptida *yA% 
here he went out < put to go out, yAm' 


21-,?ya?y occurs with monosyllabic roots, 
-a?y with dissyllabic roots. 
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ciku?’mA?y here he started out this way < 
cukum to begin, start. 

Almost every SP root and a few larger 
forms may occur with the reduplicative 
suffix which consists of a repetition of the 
phonemic shape except that, in single 
yllable roots, - (vowel length), ? and stress 
do not occur in the reduplicated form. The 
reduplicative morpheme means_ repeated 
action. 

The sequence of root plus reduplicative 
morpheme is never immediately inflected; 
it must have with it another stem or an 
outer affix (-o?y or -ne): {8i86?ypa he is able 
fo see < iS to see something, nAmnamné?ba 
he keeps saying it over and over < nam 
to say, yAksayAksané?-ba he goes like this < 
yaksa‘ to do like this (< yaks like this), 
iPAntanhdkpa he cuts it all apart by chop- 
ping < tan to chop something and < hak 
fo cross, cross something, ikuhA-thad4?ypa 
he marks it all up < ha:t to mark something 
and < ku-..-a?y to do completely. 

Certain roots occur only with the re- 
duplicative morpheme: mitrumtiruné?-ba 
he is visiting, t¥ékSekt¥ékSekné?-ba he 
wiggles his lower jaw, pdyt¥akpdyt*agé *ypa 
he runs around in a large circle (cf. poy 
to run). 

Two suffixes -a° factive and -a?y possessive 
occur with nonverb stems to form verb 
stems. However, three nonverb stems occur 
with outer affixes: ce?m6?y he urinated < 
cem urine, t¥i°id?y he defecated < t*ii 
feces, siksukk4?-ba he is shivering < suksuk 
cold. 

-a' factive occurs with a number of non- 
verbs to form verb stems. The resulting 
stem is always intransitive: maha:p he 
gets big < mah big, taga-p he builds a house 
< tak house, moga-p it produces corn < 
mok corn, hdé?gd:p it smokes < ho:ko 
smoke, wad¥a?yd'p he gets old < wad¥a-ya 
old man. There are a number of forms that 
appear to have verbal meaning and do not 
occur as substantives, but which must 
occur with -a°: yé:SA:p he is working, k6?-- 
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wap he is injured, kupé6?-d’4:p he gets a 
haircut. 

-A?y possessive occurs with a large number 
of nonverbs (mostly nouns) and the resulting 
stems are intransitive, though some of them 
may also be transitive. The intransitive 
stems are usually not inflected by other 
than the person markers and number suf- 
fixes. The intransitive stems indicate that 
the person marked possesses the item indi- 
cated by the nonverb stem: tag ?y he has a 
house < tak house, ku®yA?y he has wood 
and iku?ya?ypa he puts wood on it < kuy 
wood, tu?nA*y zt has a road and it’u nA ?ypa 
he follows him < tun road, k&?:mi ?y he has 
a cornfield < k&:ma cornfield, ahémagi ?ypa 
I’m going over there < hémak over there. 

There are a number of unique derivations, 
part of which can be identified as a mor- 
pheme, but the other part of which does not 
occur elsewhere. These forms usually are 
two syllables and different vowels occur in 
each syllable (see also two-syllable roots 
below). A partial list of these forms follows: 
mo'was to yawn, cf. was to bite something; 
ciput to turn inside out, ef. put to go out; 
honam to do something in a certain way 
[tanhonkmpa how shall we do it ?], ef. nam 
to say; sapit’ to hem something, cf. pit¥ 
to make a cage-like framework with string. 
Some two-syllable roots may be similarly 
related to single syllable roots: anymatama- 
tane to be visiting, cf. anmat to speak; 
wokowokone to rattle, cf. wok to bark; 
hakahaka?o?y to follow along, cf. ha‘k to 
pull something; kuwaha?a?y to dissolve, 
ef. wah to disappear. These are not merely 
reduplicated alternants since each of the 
short forms may be reduplicated as well: 
wokwokne to bark repeatedly. 

All observed verb roots contain either 
one or two syllables. Single syllable roots 
have the following shapes: CV°: pa: to 
get fat, pa: to have diarrhea; CV: pa to strain 
something, no to burn something; CVC: pet 
to sweep, sweep something, nak to go; CV-C: 
se't to return, hak to permit something, 
ka?m to stay placed; CV?-C: hi?-S to rub 
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something with the finger, me?s to wall up 
something; CVCC (final C is always /s/): 
poks to double down corn, heps to dish out 
something; CV ?CC: so?ps to get tired, ne?ks 
to tie grass on a house roof. In the above pat- 
terns the first C means a consonant or 
nothing: CVC: uk to drink. 

Two-syllable roots all contain the same 
vowel in both syllables: CVCV: Sit¥i to 
shiver, ala to move back and forth; CVCVC: 
cukum to start, togoy to be lost, tagay to 
enter, hukun to stir something; CVCCVC: 
sehken to become loose, t¥kSek to move the 
lower jaw; other patterns: wo?nko to roll 
about on the ground, potoks to bend a finger 
joint, tanga to be upside down. 


5. Substantives are considered to be a 
single class because they exhibit certain dis- 
tributional characteristics in common: (1) 
they all participate to some extent in the 
substantive-inflection system, (2) verbs are 
derived from all members of the substan- 
tive class in the same way, (3) participles 
are derived from all members of the sub- 
stantive class in the same way. Substan- 
tives are divided into major subclasses 
chiefly by their occurrence with the person 
markers. 

5.1. The inflectional system occurring 
with nouns consists of the person markers 
(see 2) and four orders of inflectional suf- 
fixes. Nouns are differentiated from other 
classes of substantives in that they (nouns) 
occur optionally with Set IIT person markers. 
Other classes either do not occur with Set 
II person markers, or a member of Set II 
is obligatory. 

There follows a chart of noun inflection. 
All classes of substantives participate in 
this system in one way or another. 





Per. Order 
markers Order I Order II Order III IV2 
Set I 

a- -ma 

ta- 

mi- -anhoh -ho-m 

22 Cf. fn. 4, 
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Set II 
-anho-m -tam 
an- Stem -ka?-m 
tan 
ifi- -yukma -yah 
i- -kuka?m: 
Set III 
-tam -anku?k 
man- -gak —-tij 
ane- 


The meanings of these forms will be given 
as each group is discussed. The occurrence 
of person markers with nouns has already 
been discused in 2. 

Some arbitrary limitations of distribution 
have been observed: -anhoh has been ob- 
served to be followed only by -ho-m, but 
by no other suffix; when -tam occurs in 
Order I only -aphoh and -ka?‘m have been 
observed following it; -gak occurs either 
immediately after the noun stem or pre. 
ceded by -ka?'m, -gak is usually followed 
by -t¥i, in noun inflection -t¥i has not been 
observed without -gak. 

The various orders of suffixes are discussed 
in a more or less random sequence since 
discussion in terms of general categories is 
more helpful than one in terms of orders. 

SP nouns are inflected with two number 
suffixes, -tam and -yah, which have counter- 
parts in verb inflection, and with the suffx 
-anhoh. 

-yah plural (Order III) occurs with most 
nouns; however, the lack of this suffix does 
not necessarily mean singular. It is possible 
to speak of more than one without using 
any number suffix. Plural affixes are optional. 
tXkyah houses, kiyyah trees, ké-fikuyyah 
chairs, yé-yayah pigs, ch‘Siyah children. 

-anhoh mass plural (Order I) occurs witha 
number of nouns: tagdphoh many house 
together, a village, t6?;danhoh much paper in 
a pile, cadnhoh many rocks, a rocky place. 

-tam plural (Order I) occurs with a fev 
nouns referring to human beings: yé:mtam 
women, cA-St¥am children, it¥£-watam his 
brothers, w6iit¥’am Little girls. This subclass 
may also be pluralized by -yah: wo-itiyah 
little girls. A different subclass, containing 4 
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limited number of stems, whose inflection 
is otherwise identical with that of nouns, 
is pluralized with -tam. This subclass con- 
tains ‘adjective-like’ words: wAtam good 
ones, cAbactam red ones, yikmatam high 
ones. 

-yah and -tam are also used to pluralize 
the person markers (see also 3.1). When 
Set II is present -tam occurs with first 
and second person markers and -yah occurs 
with the third person marker. When used 
in this way -tam occurs in Order III:% 
antAktam our (excl.) house, anké:mhé‘mtam 
in our (excl.) cornfield, tankiwahtam our 
(incl.) horse, iyo‘yayah their pig, his pigs, 
their pigs, -yah occurs with Set II person 
markers but pluralizes the noun rather than 
the person: antAkyah my houses. 

-tam also occurs with Sets I and III when 
these occur with a noun stem: tapA‘sifit¥am 
we all are men, miyd'yatam you are all pigs, 
mancA‘st¥am you are my children. -yah 
occurs optionally in some of these situations: 
(he?) pA-Sifiyah they are men, miyé:yayah 
you are pigs. 

There are seven locative suffices in SP 
(Order II). Nouns inflected with these affixes 
do not function syntactically as subject or 
object, nor do they occur with Sets I and 
III person markers. 

-ma instrumental: kiyma with a stick, 
wood, k&:pma with an arrow, petkiyma 
with a broom, nXk pd-yirma he went running. 
With a few nouns -ma has locative function: 
itvAkma his country cf. tak house, it*® 
ié?-sma he is in bed cf. cé?:s bed. 

-ho'm locative, in: takhé'm in the house, 
ki:mhé-m in the cornfield, t6é*thé'm in the 
book. -ho‘m also occurs with some nouns 
which, when occurring with verbs of motion 
(go, come, etc.), could be glossed to get... : 


*3 It should be pointed out that -tam and -yah 
are observed following members of Order II only 
when functioning in this way. It would, therefore, 
be possible to include -yah as a member of Order I 
and say that Order I pluralizes the noun and Order 
Ill pluralizes the person markers, except that no 
member of Order II has yet been observed follow- 
ing -yah. 
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ndkne mokhé'm he has gone to get corn 
ef. mok corn, é6yyah sakhé-m they have been 
to get beans cf. sak beans. 

-aphé-'m locative, among: yé-ya?anhé'‘m 
among the pigs, itYA-wata?manhé'm among 
his brothers. 

-ka?*m locative, at: nXi?*kk?*m at the 
water, ik&*mkA?*myah at their cornfield, 
takki ?-m at the house. 

-yukma locative, up, upon: mésahyikma 
upon the table, takytikma on the house. 

-kuka ?*m locative, under: ké-fikuykuki ?*m 
under the chair, mé:sahkuki?*m under the 
table. 

-anku?k locative, in the midst: pA-Si?fiay- 
ka?k in the midst of the people. 

Suffixes -gak also, again and -t¥i just 
very often occur together with noun stems. 
Words containing both are most frequent 
in the following kind of dialogue: tf 
sh?ba inwat what are you doing? thkgaktyi 
sX?ba anwdt I’m building a house. -gak 
may occur without -t¥i: ani?kta nA?*kA? 
ngak we arrived at the next water. 

5.2. Pronouns are differentiated from 
nouns in that they occur with Set I person 
markers but not with Sets II and III. 
Pronouns are also inflected for number, 
occur with a few locative suffixes, and with 
suffixes -gak and -t*i. Five pronouns are 
distinguished: aé J, me, mié you, he/he?m*™4 
he, him, she, her, it, pe®m that one, i: who, 
whom. Set I person markers occur with 
pronouns as intensives (when the marker 
for the same person is added), and to form 
stative constructions (when the marker 
for a different person is added): a?Aé J, 
ta?Aé you and I, mimié you; ahé I am he, 
mi?i: who are you? 

Number suffixes occur with pronouns: 
Aét¥am we, us, mitt¥am you (pl.), {t¥am 
who (pl.), hé?-yah they, pé?myah those. 

Three locatives have been observed with 
pronouns: acékA?*m at my place, mimiét¥a ?- 


24Pronouns may modify nouns in syntactic 
constructions. Therefore, he?m (usually trans- 
lated the) is considered to be an alternant of he in 
attributive position. 
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maphé'm among you, pe*mytikma upon 
that one. 

-gak and -t¥i occur with pronouns: 
aAégakt*i I also, hé?-gakt*i he also, i-gagam 
who else? 

5.3. There is a group of words which are 
distinguished from other substantives in 
that they always occur with one member of 
Set II person markers; we call these forms 
nominals. Nominals do not occur system- 
atically with any other substantive in- 
flection. Some nominals occur with loca- 
tives, number suffixes, and -gak and -t’i 
while others do not. Of the various sub- 
classes of substantives, nominals exhibit 
the least unity of syntactic function: it*d 
mpay all of them, tantti‘mpay all of us, 
ikuyikma the reason for it, ikushywii 
above it, ikukA?*m beneath it, ikundswiii 
down below it, anhdé-dop my knowing, 
ihé?-ya his anger. 

5.4. Adjectives, like pronouns, occur 
only with Set I person markers and with 
number suffixes, but unlike pronouns, they 
do not occur with locatives: ahé?yantam 
there are many of us, Sit’uyah they are small, 
asinkoh we are five. 


6. As with verbs the difference between 
derivational and inflectional elements of 
substantives is not completely clear. The 
following criteria are used to distinguish 
them: (a) Any morpheme which occurs 
with a verb stem to form a substantive is 
derivational; (b) any morpheme which 
occurs as a prefix to the second stem or as a 
suffix to the first stem of a compound is 
derivational in all occurrences; (c) any 
affix morpheme which radically changes the 
meaning of the stem is derivational. 

6.1. Derivational affixes occur with roots 
and stems of various classes to form sub- 
stantive stems. 

There follow illustrations of affixes occur- 
ring with substantive roots and stems to 
form new stems: 

-tu'ku old, valueless occurs with a number 
of nouns: taktiku old, dilapidated house, 
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pukti-ku cloth ef. pirki cotton, yuktirku 
orphan cf. ha?yuk younger sibling, pétkuy- 
ti‘ku old broom. 

-tYa‘ka without, -less occurs with a few 
nouns, generally body parts: kA?-t¥4‘ka arm- 
less, puyt*4'ka without a foot, tu ?ct¥a‘ka with- 
out a tail. 

-te'ro™ doer, performer (see below for oc- 
currence also with verb stems): yé-mté-ro 
ladies’ man, yé-yaté'ro hog dealer, kiwahté-ro 
good horseman. 

-ko'm place of occurs with a few substan- 
tives: wA'ké-m good place, tA‘8k6-m Ocoso- 
tepec, ¢inké-m place of pine trees. 

-pak arroyo*® (cf. pakus arroyo): kA-ypak 
Tilapa River (lit. jaguar arroyo), éSpak So- 
chapan River (lit. crawfish arroyo), nk?anné: 
ypak an unnamed (on maps) tributary to the 
Guasuntlan River (lit. water-flower arroyo). 

There follow illustrations of affixes oceur- 
ring with verb roots and stems to form 
substantive stems: 

-kuy instrumental occurs with a large| 
number of verb stems to form nouns) 
pétkuy broom < pet to sweep, sweep some- 
thing, haykuy pencil, pen < hay to write, 
iSkuy eye < iS to see something, sé?tsedé?- 
ykuy frying pan < se?tsedo*y to be frying. 
-kuy also occurs, in fewer forms than above, 
with meaning intended object of action: 
yom ?ykuy one who should marry < 
yo?'ma ?y to have a wife, take a wife, k6?ckuy 





one who should be hit < koe to hit some-| 
thing. 

-i result of action occurs with a large 
number of verb stems to form nouns:} 
pihi heat < pih to heat, get hot, ha*pi 
corn meal dough < ha?p to grind something, 
wki a drunk < uk to drink, drink something, 
AkSi shelled corn < aks to shell corn, hiki 
cigar, cigarette < huk to smoke. 

-A/-a result of action occurs with a few! 
stems instead of -i or, in a few cases, iD 
addition to -i: pu?ka tropical ulcer <| 


' 
; 


25 This suffix has an atypical phonemic shape. It 
may be there has been Spanish influence. | 
26 See also Wonderly, William L. Zoque Place} 
Names, IJAL 12.217-228 (1946), esp. p. 223. 
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pu’k te rot, mé-ya flower < moy to foam, 
flower, kuy84?*ma splinter < sa?m to splin- 
ter off. 

-ka?-wif plural actors occurs with a few 
stems, usually intransitive ones: kuyt: 
hkA?wih learners, pupils < kuyuh to learn, 
yOSakA wifi workers yorSa: to work. 

-al) occurs with one verb stem to form a 
noun: hfpsay live coal < hips to burn. 

-te'ro performer of action (see above for 
occurrence with substantive stems): hayt’- 
éro talker, one who talks a lot < hay to 
talk, yO"Saté-ro worker, one who works hard < 
yosa’ to work, mi-ét¥é-ro player, one who 
likes to play < ma‘é to play, ecté:ro dancer 
< ec to dance. 


-pAy measure occurs with a few verbs to . 


form substantives téfipay the height of a 
man < tefi to stand up, ancénpay the 
amount in a sack to where the mouth can be 
tied < ancen to tie something closed. -pay 
also occurs with substantives: it’i:mpay 
all of wz < tum one, yakspay like this, 
about this much < yaks like this. 

-kay occurs with a few forms to derive 
adjectives: hikskay quickly, rapid < hiks 
to hurry, tXhkay fast, Séhkay light weight, 
wéskay fragile. 

There are several derivational prefixes 
which occur with substantives (and nu- 
merals) to form substantive stems: 

ku- spread, extent (see also 4.3) occurs 
with some roots that do not otherwise 
occur: kuté-t¥i brains, ku?A-Si foolish, 
kukivma ring, ikush:ka his nakedness, 
kup6?k runt. ku- also occurs with other 
substantive stems and roots: kut*ifi lazy < 
tii excreta, ikuwifi his form < wifi width, 
ikuyikma the reason for it < yikma up, 
height, ikusXywifi up above it < sAnwiii up- 
ward. ku- also occurs with numerals and ad- 
jectives form nominals: ikuttku the three 
of them, ikuwAstak both of them, ikut’u'm 
by himself, ikusinkoh the five of them. 

ay)- point (see also 4.3) occurs with some 
roots which do not otherwise occur: ankéhke 
chin, anwiSi beard. an- also occurs with 
other substantives: antAc tooth of a comb < 
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tac tooth, anki-ii point of a knife, stick, etc. < 
kifii nose, a’ndy page of a book < ay leaf, 
ana fi pimple < afi tortilla. 

na- similar (see also 4.1) occurs with a 
few forms: inimi‘Sit¥A-wa his fellows Oaxa- 
queno < mfSi Oaxaquefio and < th-wa 
brother, fellow, iniyé‘mtA-wa her sister with 
her < yé-mtA-ma sister. 

ak- (see 4.1) occurs with one noun: 
akea testicles < ca rock. 

waga- (see 3.1) occurs with a few forms: 
wagayo'mo twin girls < yémo woman, 
wagapA‘sifi twin boys < pA-Sifi man. 

6.2. Two types of compound substan- 
tives are recognized, regular compounds 
and enclosed compounds. 

Examples of regular compounds are: 
nu?’nktypak needles < nti?nkuy sewing 
instrument and < pak bone, isktiiypak eye- 
brow, eyelash < iSkuy eye and < pak body 
hair, feathers, sdhtam acorns < soh oak 
tree and < tam fruit, mA?ta:tuy father-in- 
law < mi? son-in-law and < ha-tuy 
father, kiyham ashes < kuy wood and < 
ham lime, kobéktak hat < k6é:bak head and 
< tak house, curé-S{ tarantula < cu night 
and < é8i crawfish, kuyké-bak tree stump 
< kuy tree, wood and < ké:bak head. There 
are three special compounding fo. rms: nunta- 
real, very, naps- pure, and mu ma- whole, 
complete. Examples are: ntintawA- very goed, 
nunténa real water, just water, nintahs-yi 
Popoluca language; ndpsna pure water, 
napshéckuy just combs; mi*maha-ma all day, 
mumacti all night. 

There are remnants of enclosed com- 
pounds: kuéi?’paénna water that has been 
bathed in < &in to bathe (cf. na water), 
kuce Anna water that has been washed in < 
ce to wash something. There is an enclosing 
form meaning season of: kuhd-msayn dry 
season < hd-ma sun (cf. san sky), kusagaéyjka 
bean season < sak beans (cf. Anka yard), 
kuktiyt’a ?maynka avocado season < kiyt’am 
avocado, kumogd4nka corn season < mok 
corn, kuttihka rainy season < tuh rain 
(here only ka is present, this is perhaps an 
alternant of 4nka.). 
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There is one enclosing form that appears 
to consist of a prefix-suffix combination: 
ikuhagan a piece of it < hak to break cross- 
wise, go across, ikuck?yan something left 
over of 1t < ca?y to remain. 

6.3. There are a few forms which recur 
but to which no meaning can be assigned: 
-Gi: tAkéi cockroach (cf. tak house). kiiti 
nasal catarrh (ef. ki-fii nose), pukuinéi chigger, 
né?fiti rooster’s comb, ju?piti rainbow; -t*i/ 
-fii??: pdreét*i curly, ké?ét¥%i blind, tutti 
humming bird, si:miii banana, sAyii day- 
light (cf. san sky); -tuk: anwifit’uk on the 
other side (cf. wii width), pi-iitYuk deserted 
(ef. piri to fade into the distance). 

There are a few unique forms in what 
appear to be compounds: cii?’mak last night 
(cf. cu night), hA-pak corncob (cf. pak bone), 
nA?*pifi blood (cf. na water, liquid; also cf. 
Proto-zoquean *pan man), cA:nhon wild 
turkey (cf. hon bird). 

6.4. There are a few substantives derived 
from verbs by shift of stress: kApun barrow 
(cf. ikaptinpa he castrates, emasculates it), 
iwatnas his image (cf. iwatnaspa he makes it 
first), aphdkket slope (no verb counterpart 
has been observed, but the constituent parts 
can be identified as verbs). 

6.5. SP contains a large number of sub- 
stantives formed by reduplication of a 
root: pinpun slick, cAncan the smell of oil or 
metal, maiymay happy, ki*ikii hairy, fuzzy, 
tittit mestizo, si?:sa stem; hayahdya fast, 
quick, nituntitu bumpy, rough skin, &£-Sti-Si 
stink bug; iSpagi8pak friend, tA *nguyt*A n- 
guy¢ancan the smell of ink. 

6.6. Substantive roots occur in a number 
of different phonemic shapes: CVC: tuh 
rain, mok corn, hon bird; CV?C: mu?k 
grass, cure corpse; CV°C: pork corn stalk, 
cai snake; CV?-C: ko?y basket, sa?-p now; 
CV°V: mi?a deer, n&?a chewing gum; 
CVCV: kit*u harelip, t¥4ka chick, Situ 
small. A very large number of substantive 
roots have the shape CV‘CV. Normally, the 
second vowel is the same as the first: yé6-mo 


27 -fi occurs after nasals, -t¥i occurs elsewhere. 
28 The stem is of Spanish origin. 


OF AMERICAN LINGUISTICS VOL. XxXVi 
woman, hé-he mosquito, hi:ha house fly 
té'to paper, k&é-ma cornfield, kiii now, 
ké-ya rabbit, yé-ya pig. Other shapes ar 


possible: éhyah sister-in-law, 4>-wi Jishhook, 


kA ak huarache, 4:tébet Soteapan. 


7. Numerals are a very small class con 


sisting only of the numbers two through 
five, the question word how many, and on 
or two other forms, yet they have a char 
acteristic inflection which is different from 
all other inflectional systems in SP. Nu 
merals occur with Set I person marker; 
with number suffixes (substantive inflection 
-tam and -yah), and with -gak and -t% 
Numerals are distinguished from othe 
classes by the inflectional affixes -te’n an( 
-na*: wasté-n two, tuikuté-n three, maktasté1 
four, mosté-n five,® hi-té-n how many? The 
difference between -tevn and -na° is not 
clear. The former is usually used in count: 
ing and when the numeral is subject oa 
object; the latter usually occurs when the 
numeral is in attributive position, though 
both forms have been observed in all pos: 
tions. Forms containing -na’ are frequently 
accompanied by a number suffix: wasnayal 


there are two, atikuna‘tam there are three i 


us. Other illustrations: mihti-né:tam how 
many are there of you?, wasté-ngakt*i éy twi 
of them went. 


8. Members of any class of stems which 
participate in word inflection (see 1) may le 
used for participle derivation. Two method: 
of participle derivation are possible: on 
method is used to form participles from 
verbs, the other to form participles from 
nonverbs. Since a difference in syntactit 
function also parallels the resultant types 
participles, we distinguish V-participles ani 
N-participles. 

29 Numbers above five in Sierra Popoluca have 
not been observed, though there are reportedly 
old people who know them. (I could not seem to 
find any who could or would give them to me) 
Actually, numbers for four and five are seldom 


used in speech, Spanish numbers being preferred 
instead. 
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8.1. There are two suffixes used to derive 
\-participles; these parallel the contrast 
between ‘completive’ and ‘incompletive’ 
which is found elsewhere in the SP inflection 


' systems: -pa?*p incompletive participlizer and 
| -wa?p completive participlizer. Syntactically 
lass con 


\-participles function as participant or asso- 
ciate or as a part of certain noun phrases; 
they are usuaily preceded by he?m the, that: 
he?m nakp4?*p ip&*tpa the one who goes 
will find it, afiyé-Sp4 *tum he?m aktanwA ?*p 
I helped the one who fell down. 

Inflectional prefixes and Orders I and II 
of inflectional suffixes may occur with the 
verb stem before the participlizing suffixes 
are added: he?m a?ffst¥anwi?-p pdy the one 
we saw ran, he?m k6?ctawA?'p the one some- 
one hit. 

Actually the participlizing affixes appear 
to form an IC with the entire clause: he?m 
nakwi?p a'tébet mathk togédy tunphé-‘m 
the one who went to Soteapan yesterday got 
lost on the road, he?m iko?cwi?'p it’A*wa 
ti?-t¥a kastigo the one who hit his brother was 
punished. 

8.2. A single suffix, -pak, is used to derive 
N-participles: cAbacpak red ones < cdbac 
red, yompak female ones < yé"mo woman, 
he?m wasté'npak the two < wasté'n two. 

Inflectional prefixes and suffixes may oc- 
eur with substantive stems before -pak is 
added: k4-mhé-mpak one from the cornfield, 
it¥akkA?*mpak the one from his house, 
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cdbactampak red ones, kécakyikmapak 
one from the mountain. 

When the negative particle d’a occurs 
with a verb and a participle is formed, -pak 
occurs with d¥a rather than -pa?*p or -wa?"p 
with the verb stem: he?m d’4pak nik the 
one who did not go. 


9. Certain stems which do not participate 
in any of the inflection systems other than 
word inflection, and therefore can be re- 
garded as particles, consist of more than one 
morpheme. 

Several stems contain an element -ak: 
hémak over there (cf. hé-m there), hésak then, 
wifiak long ago. 

There are two affixes which occur with 
numerals and some other forms to derive 
particles, -ma days hence and ma- days past: 
maktasma four days hence, ttikma three days 
hence, wastAkma day after tomorrow, héyma 
tomorrow, amtYAyma next year; matik 
yesterday, mawAski two days ago, matii-ki 
three days ago, ma?amt¥Ayma last year. 

Some forms in which the suffix -kay 
(see 6.1) occurs are particles: wAskay twice 
<was- two, wa't’kay many times < wa't¥i 
many. 

Other forms have constituent morphemes 
that are present in substantive derivation 
and inflection: anwifit’uk on the other side, 
ansh‘kma outside, anhdkket slope, ankA?*m 
outside (i.e. not in the house). 
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9. The nonsingular subjective prefix 
9.3. The pronominal constituent 


9.7. The number morpheme 

9.10. With the modal prefix 

9.12. Summary 

10. The objective prefix 

10.5. The singular objective prefix 

10.9. With the modal prefix 

10.11. The nonsingular objective prefix 

10.14. With the modal prefix 

10.16. Summary 

11. The transitive prefix 

11.2. Combinations of first and second 
persons 

11.4. Masculine subject and object 

11.7. Restricted third person subject and 
object 

11.11. Other combinations 

11.14. With the modal prefix 

11.16. Summary 

9.1. The term PRONOMINAL PREFIX will be 


used from this point on to include the singu- 
lar subjective prefix described in 6, which is 
the irreducible pronominal prefix, and the 
expansions of it to be described in 9-11. 
These include the nonsingular subjective 
prefix (9), the singular and nonsingular ob- 
jective prefix (10), and the transitive prefix 
(11). 

9.2. The singular subjective prefix de- 
scribed in 6 may be expanded by the suffixa- 
tion to it of a NUMBER MORPHEME to form a 
NONSINGULAR SUBJECTIVE PREFIX. The num- 
ber morphemes introduced in this section are 
the puAL (-ni-/-n-/-ny-) two and the PLURAL 
(-wa-/-wee-/-we-/-w-/-y-/-0-/-ti-/-n-) three 
or more. A third, the NONSINGULAR (two or 
more), will appear in 10 and 11. These mor- 
phemes are not in all cases simply added to 


* Seneca Morphology I and II, IJAL 26.11-22, 
123-129 


the pronominal morphemes of 6, but con. 
dition certain modifications of the pr. 
nominal constituent. The following diagram 
shows in a general way the structure of; 
verb or noun containing the nonsingular sub. 
jective prefix; the broken lines indicate that 
the modal prefix may or may not be present 











(8): 

aes : Verb or Noun 

: Nonsingular Subjective Verb 

! Modal Prefix ail 
| Prefix | Pronominal Number Stem 
i Constituent | Morpheme 








An example is haj|ti- masculine|plural in 
hati:hsa:s they’re looking for it, hati: waji:- 
yee? their family. 

9.3. Instead of being added directly to the 
first person morpheme the number mor 
pheme occurs with the EXCLUSIVE PERSOY 
construction, consisting of the first person 
morpheme (-k-) preceded by the exclusive 
person morpheme (?a-), which indicates that 
reference is made to someone in addition to 
the speaker, but excluding the hearer: ?akh- 
niyetha? we (excl. du.) are hitting it (withh 
between the k and n by 3.9), ?akwdyetha’ 
we (excl. pl.) are hitting it. 

9.4. There is also an INCLUSIVE PERSON 
morpheme (h-/t-), indicating that the hearer 
is included, which occurs alone (not in con- 
struction with the first person morpheme) 
before the number morpheme. The inclusive 
allomorph is h- before the dual morpheme, 
t- before the plural: hniyétha? we (incl. du.) 
are hitting it, twayétha? we (incl. pl.) are hit- 
ting it. 

9.5. The second person (s-/j-) and mascu- 

1 As noted in 8, the modal prefix only occurs it 


a verb, and then only with a punctual or impera- 
tive verb stem. 
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ne (h-/ha-/he-) morphemes occur directly 
before the number morpheme. The second 
person allomorph is s- everywhere except be- 
fore the plural allomorph -0-, where it is j- 
(98): sniyétha? you (du.) are hitting it, 
swayétha? you (pl.) are hitting it. The mascu- 
line allomorph is h- before the dual mor- 
pheme, ha- before the plural allomorph -ti-, 
and he- before the plural allomorph -n- (9.9): 
hniyétha? they (masc. du.) are hitting it,? 
hatiyetha? they (masc. pl.) are hitting it, heni?- 
tya? they (masc. pl.) dwell. 

9.6. Neither the feminine nor the neuter 
morpheme occurs before the number mor- 
pheme, but their meanings are combined in 
the NONMASCULINE morpheme (k-/wa-/we-), 
which does so occur. The nonmasculine al- 
lomorph is k- before the dual morpheme, wa- 
before the plural allomorph -ti-, and we- be- 
fore the plural allomorph -n-: khniyétha? 
they (nonmasc. du.) are hitting it, watyjetha? 
they (nonmasc. pl.) are hitting zt, weni?tyo? 
they (nonmasc. pl.) dwell. 

9.7. The dual allomorphs occur as follows: 
ni- occurs before a morpheme whose basic 
allomorph begins with a consonant: hniye: 
thoh let’s (incl. du.) plant! (-yethw-), sni: 
awi? you (du.) are carrying something (-’[h]- 
awi-); -n- occurs before a morpheme whose 
basic allomorph begins with a vowel other 
than a or e: ?ihne? we (incl. du.) are walking 
(e-), khnohe? they (nonmasc. du.) are alive 
(-ohe-) ; -ny- occurs before a morpheme whose 
basic allomorph begins with a or e: ?akya:toh 
we (excl. du.) say it (-ata-), hnyetd:s they 
(masc.) du.) are waving it (-etd-). 

9.8. The plural allomorphs can be divided 
into those which occur after the first person, 
inclusive person, and second person mor- 
phemes (-wa-/-wee-/-we-/-w-/-y-/-0-), and 
those which occur after the mascwine and 
nonmasculine morphemes (-ti-/-n-). The first 
group occurs as follows: -wa- occurs before a 
morpheme whose basic allomorph begins 
with a consonant, but not before (C)z or 
(y)o: ?akwAthe?tha? we (excl. pl.) are pound- 

2 In word-initial position this prefix is homony- 
mous with the inclusive dual (9.4). 
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ing (-thé?t-), twa: nékezh let’s (incl. pl.) take 
a drink! (-'[h/:|neké-); -wee- occurs before a 
morpheme whose basic allomorph begins 
with (C)ze: sw#@:?theh (you [pl.]) climb! 
(-[C]#?the-) ; -we- occurs before a morpheme 
whose basic allomorph begins with (y)o or i, 
and in the latter case the i is lacking: ?ak- 
weokweh we (excl. pl.) have taken it for our 
own (-[ylokw-), swe?tyo? you (pl.) dwell 
(-i?tyo-); -w- occurs before a morpheme 
whose basic allomorph begins with a, e, or ¢, 
and before the morpheme -i- say: ?itwe? we 
(inel. pl.) are walking, ?iswih (you [pl.]) say 
it! (-i-); -y- occurs after the first and inclusive 
person morphemes before a morpheme whose 
basic allomorph begins with o or a: 
?akyo:he? we (excl. pl.) are alive, tyohe? we 
(incl. pl.) are alive; -o- occurs after the 
second person morpheme (j-) before a mor- 
pheme whose basic allomorph begins with o 
or 0: johe? you (pl.) are alive. 

9.9. After the masculine and nonmasculine 
morphemes the plural allomorphs occur as 
follows: -ti- occurs before a morpheme whose 
basic allomorph begins with a consonant: 
hatithe?tha? they (masc. pl.) are pounding, 
watithe?’tha? they (nonmasc. pl.) are pound- 
ing; -n- occurs before a morpheme whose 
basic allomorph begins with a vowel: heno:- 
tah they (mase. pl.) say it (-ato-; 3.14), weno:- 
toh they (nonmasc. pl.) say it. 

9.10. When the indicative morpheme pre- 
cedes a nonsingular subjective prefix, its 
word-initial allomorph is wa- before the ex- 
clusive person and masculine morphemes, 
?o?- before the nonmasculine, and ’e- before 
the inclusive and second persons. The opta- 
tive allomorph is ?ae- before the inclusive 
and second persons, ?aa- otherwise. The 
future allomorph is in all cases ?e-. 

9.11. After any modal prefix the mascu- 
line allomorph -’:- occurs in place of the 
word-initial h- (before the dual morpheme): 
waini:yet they (masc. du.) hit it, ?A:ni:yet 
they (masc. du.) might hit it, ?e:ni:yet they 
(masc. du.) will hit it. The initial masculine 
allomorphs ha- and he- (before the plural 
morpheme) occur without the initial h and 
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the immediately preceding vowel is basically 
strong (cf. 8.7): wa:ti:yet they (masc. pl.) hit 
it (with the masculine allomorph : by 3.19), 
waenoté?ko? they (masc. pl.) fled (-até?kw-), 
&:tityet they (masc. pl.) might hit it, ?aend- 
te?ko? they (masc. pl.) ought to flee, ?eati: yet 
they (masc. pl.) will hit it (3.14), ?¢:note?ko? 
they (masc. pl.) will flee. After the optative 
and future morphemes, but not the indica- 
tive, the exclusive person allomorph is -ya-, 
rather than the word-initial ?a- (9.3): ?a:- 
yakwa: yet we (excl. pl.) might hit it, ?eyak- 
wayet we (excl. pl.) will hit ct. With these 
exceptions, the same allomorphs occur after 
the modal prefix morphemes as in word- 
initial position. 

9.12. The following is a list of the non- 
singular subjective prefixes, exemplified both 
in word-initial position and after the indica- 
tive and future morphemes. The verb root is 
-ke- see in the iterative (-kéh) and punctual 
(-ké?) aspects. 

?akhni-/?akhn-/?akhny- (/-yakhni- etc.) 
we (excl. du.): ?akhni:keh we see it, wa?&kh- 
ni:ke? we saw it, ?eyAkhni: ke? we’ll see zt. 

?akwa-/? akwe-/? akwe-/? akw-/?aky- 
(/-yakwa- etc.) we (excl. pl.): ?akwa: keh we 
see it, wa?akwake? we saw it, ?eyakwake? 
we'll see it. 

hni-/hn-/hny- we (incl. du.): hni:keh we 
see it, ?ehni:ke? we saw it, ?ehni:ke? we'll 
see 2t. 

twa-/twe-/twe-/tw-/ty- we (incl. pl.): 
twa:keh we see it, ?etwa:ke? we saw it, 
?etwa:ke? we'll see it. 

sni-/sn-/sny- you (du.): sni:keh you see it, 
?esni:ke? you saw it, ?esni:ke? you'll see it. 

swa-/swee-/swe-/sw-/j- you (pl.): swa:keh 
you see it, ?eswa:ke? you saw it, ?eswa:ke? 
you'll see tt. 

hni-/hn-/hny- (/-’:ni- etc.) they (masc. 
du.): hni:keh they see it, wa:ni:ke? they saw 
it, ?e:ni:ke? they'll see tt. 

hati-/hen- (/-’ati- ete.) they (masc. pl.): 
hati:keh they see it, wa:ti:ke? they saw it, 
?eati: ke? they'll see it. 

khni-/khn-/khny- they (nonmasc. du.): 
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khni: keh they see it, ?o?khni:ke? they saw; 
?ekhni:ke? they'll see it. 

wati-/wen- they (nonmasc. pl.): watitke 
they see it, ?o?wati:ke? they saw it, ?ewoti: 
ke? they'll see it. 
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6.2). With the masculine and feminine mor 
phemes, however, a performer that is mas 
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understood: honyo:h he’s killed it, it’s kallel 
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these in the descriptive aspect the pronomi- 
nal constituent is usually translated as a 
person or thing described: hot&?oh he’s asleep 
(ité?-), honya:koh he’s married (-nyak-), 
hoyo:ti:h he’s smiling (-yo[:]ti-). 

10.3. With a verb stem in any other 
aspect, the pronominal constituent of the ob- 
jective prefix denotes simply the goal of what- 
ever is denoted by the stem, with either a 
neuter, or, under the conditions noted above, 
a masculine or feminine performer under- 
stood: sanyos it’s killing you, honyos it’s 
killing him, he’s killing him, konyos it’s killing 
her, she’s killing her. With stems that take 
oly the objective prefix, however, the pro- 
nominal constituent is usually translated as 
the performer: hota?was he sleeps (-ita?w-; 
§.3), honya:s he marries, sayo:tih (you) 
smile! 

10.4. With a noun stem, the pronominal 
constituent is modified by the objective 
morpheme to indicate in most cases the 
alienable possessor of whatever is denoted by 
the stem (cf. 7.2): ?akéohta? my interpreter 
(cf. kAohta? my ear), sa?dshe? your basket 
(-4sh[ee]-), hoeno? his song (-en-), koya:a? 
her bag (-yals]-; 5.4), ?0?no? zts arrow (-’?n-). 
Afew stems denoting body parts occur with 
an objective prefix: ?akyo:skwa? my rib 
(-yoskw-), ?akéthwehsa? my liver (-thwéhs-). 
The neuter objective prefix also occurs with 
many stems to indicate specific existence: 
*osno? the bark (cf. kasno? bark), and a num- 
ber of stems occur only with the neuter ob- 
jective form: ?onéo? corn (-né-). 

10.5. The pronominal morphemes of the 
singular objective prefix are identical with 
those of the singular subjective prefix; i.e. 
the first person (-k-/-ke-), second person (s-), 
masculine (h-), feminine (k-), and neuter 
(?-) morphemes. The pronominal constituent 
comes first unless it is the first person mor- 
pheme: s|a- second person|objective in sdihsa:s 
it’s looking for you, h\o- masculine|objective in 
héihsa:s it’s looking for him, but ?a|k- ob- 
| jective| first person in ?akihsa:s it’s looking for 
| me. 

10.6. The first person allomorphs are 
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-k-/-ke- after the objective, conditioned as 
described in 6.4. With the first person mor- 
pheme the objective allomorph is ?a- in 
word-initial position: ?aka:weh it belongs to 
me (-awe-), °akéthe?tah I’ve pounded it. 

10.7. After the second person morpheme 
(s-) the objective allomorph is -a- before a 
morpheme whose basic allomorph begins 
with a consonant, 0, or 0, but not before 
(C)e or (y)o: sayéhtah you’ve hit it, s&oseeoh 
youve sprinkled it (-os&-); -2- before a mor- 
pheme whose basic allomorph begins with 
(C)e: sé: ?the:h you’ve climbed it; -e- before 
a morpheme whose basic allomorph begins 
with (y)o or i, and in the latter case the i is 
lacking: seokweh you’ve taken it for your own, 
se: tah go to sleep! ; and -0- before a morpheme 
whose basic allomorph begins with a, e, or 
e: se?ah you have intended it (-e?-), sa: weh 
it belongs to you.4 

10.8. After the masculine (h-), feminine 
(k-), and neuter (?-) morphemes the ob- 
jective allomorph is -o- before a morpheme 
whose basic allomorph begins with a con- 
sonant, a, or i, and in the latter two cases 
the a or i is lacking: hoyéhtah he’s hit it, 
ko:weh it belongs to her, ?0tA?oh it’s asleep; 
-a- before a morpheme whose basic allomorph 
begins with or or 9: héoszoh he’s sprinkled it; 
and -aw- before a morpheme whose basic 
allomorph begins with e or e: hawe:?ah he 
has intended it. There are, however, some 
roots in ¢ before which -o- occurs, the e then 
being absent. Such roots will be listed with 
initial (e): -(e)ndhto-know (?akénohto? J 
know it, hondhto? he knows it. 

10.9. When a modal prefix is present, the 
word-initial indicative allomorph is ?- before 
the first person, ?e- before the second person, 
and wa- before the masculine, feminine, and 
neuter singular objective prefixes. The opta- 
tive allomorph is ?ae- before the second 
person singular objective prefix, ?aa- before 
the others. The future allomorph is ?¢- before 
all of them. 

10.10. After the indicative prefix (?-) the 


4 See Seneca Morphology II, fn. 11. 
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objective allomorph before the first person 
morpheme is -0-: ?okyet zt hit me; after the 
optative and future it is -wa-: ?a:wakyet 7t 
might hit me, ?ewo:kyet zt will hit me (3.14). 
The second person allomorph remains s- in 
the singular objective prefix after any modal 
prefix (cf. 8.7): ?esa: yet zt hit you, ?aesayet 
at might hit you, ?esa: yet it will hit you. After 
any modal prefix the h of the masculine 
allomorph is lacking and the preceding vowel 
is basically strong: wa:oyet it hit him, 
?&:oyet it might hit him, ?e:oyet zt will hit 
him (3.18). After the indicative morpheme 
(wa-) the feminine allomorph is -?ak- and the 
neuter -?-: wa?ako:yet it hit her, wa?o:yet 
it hit it. After the optative (?aa-) and future 
(?e-) morphemes the feminine allomorph is 
-yak- and the neuter -y-: ?eyako:yet 7 will 
hit her, ?eyo: yet it will hit it. 

10.11. As noted in 10.1, the membership of 
the pronominal constituent in the nonsingu- 
lar objective prefix is not identical with that 
of the nonsingular subjective prefix. For one 
thing, the objective prefix makes no distinc- 
tion between exclusive and inclusive persons, 
but combines their meanings by prefixing the 
objective morpheme (?9-) directly to the 
construction first person|number morpheme 
(dual or plural): ?okhnfyetha? 7t’s hitting us 
(du.), ?okwayetha? 7t’s hitting us (pl.). 

10.12. The objective morpheme occurs in 
a zero allomorph in the second person non- 
singular objective prefix: sniyétha? it’s hit- 
ting you (du.), swayétha? it’s hitting you 
(pl.).° 

10.13. In the masculine and nonmasculine 
nonsingular objective prefixes no distinction 
is made between dual and plural; i.e. the dual 
and plural morphemes do not occur, but 
their meanings are combined in the Non- 
SINGULAR morpheme (-ti-/-n-) two or more: 
hotiyetha? it’s hitting them (masc.), oti- 
yetha? it’s hitting them (nonmasc.). Here the 
objective morpheme (-o-) occurs between the 
masculine (h-) or nonmasculine (?-) mor- 
pheme and the nonsingular. The latter occurs 


5 These forms are thus homonymous with the 
second person nonsingular subjective prefix (9.5). 
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in the allomorph -ti- before a morphen: 
whose basic allomorph begins with a con. 
sonant, and -n- before a morpheme whoy 
basic allomorph begins with a vowel (¢f. 
9.9): hotithe? toh they (masc.) have pounded it 
?otithe?toh they (nonmasc.) have poundel 
it, hondte?kweh they (masc.) have flei, 
?ondte?kweh they (nonmasc.) have fled. 

10.14. When the indicative morpheme 
precedes a nonsingular objective prefix, its 
word-initial allomorph is wa- before the ob. 
jective, masculine, and nonmasculine mor. 
phemes and ?e- before the second person. 
The optative allomorph is ?ae- before the 
second person, ?aa- elsewhere. The future 
allomorph is ?¢e-. 

10.15. After any modal prefix the mascv- 
line allomorph h- is lacking and the immedi- 
ately preceding vowel is basically strong: 
waoti:yet it hit them (masc.), ?doti:yet i 
might hit them (masc.), ?eoti:yet zt will hit 
them (masc.) (3.16). After the optative and 
future morphemes, but not the indicative, 
the objective allomorph is -yo- rather than 
the word-initial ?9-, and the nonmasculine 
allomorph is -y- rather than ?-: ?a: yokwa:- 
yet tt might hit us (pl.), ?eyokwayet at will hit 
us (pl.), ?a:yoti:yet at might hit them (non- 
masc.), ?eyoti: yet it will hit them (nonmasc.), 
With these exceptions, the same allomorphs 
occur after the modal prefix morphemes as 
in word-initial position. 

10.16. The following is a list of the singu- 
lar and nonsingular objective prefixes: 

?ak-/?ake- (/-ok- ete. and -wak- etc.) me: 


?ake:keh it sees me, ?oke:ke? it saw me,}' 


?ewoke: ke? zt will see me. 


sa-/se-/se-/s- you: satkeh it sees you,}' 


?esa:ke? it saw you, ?esa:ke? zt will see you. 


ho-/ha-/haw- (/-'o- etc.) him: ho:keh i}: 
sees him, wa:oke? it saw him, ?e:oke? it will} i 


see him. 

ko-/ka-/kaw- (/-?ako- ete. and -yako- etc.) 
her: ko: keh it sees her, wa?ako: ke? it saw her, 
eyako: ke? zt will see her. 

°9-/?a-/?aw- (/-yo- etc.) it: ?o: keh at sees 
it, wa?o: ke? tt saw it, ?eyo:ke? it will see tl. 

?okhni-/?okhn-/?okhny- (/-yokhni- ete.) 
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ys (du.): ?okhni: keh zt sees us, wa?dkhni: ke? 
itsaw us, ?eydkhni: ke? zt will see us. 

%9kwa - /?okwe - /?okwe - /?akw - /?aky - 
(/-yokwa- etc.) us (pl.): ?okwa:keh zt sees us, 
ya’okwake? it saw us, ?eyokwake? zt will 
we US. 

sni-/sn-/sny- you (du.): sni: keh tt sees you, 
esni: ke? tt saw you, ?esni:ke? it will see you. 

swa-/swe-/swe-/sw-/j- you (pl.): swa: keh 
itsees you, ?eswa:ke? it saw you, ?eswo:ke? 
twill see you (3.14). 

hoti-/hon- (/-’oti- etc.) them (masc.) : hoti:- 
keh it sees them, waoti:ke? zt saw them, 
eoti:ke? at will see them. 

oti-/?on- (/-yoti- etc.) them (nonmasc.): 
oti: keh zt sees them, wa? oti: ke? tt saw them, 
eyoti: ke? zt will see them. 


11.1. The pronominal prefixes so far de- 
sribed often have transitive meanings (or at 
kast translations), usually with either the 
performer or the goal neuter (6.2, 10.2), but 
smetimes with two masculines or two femi- 
tines (10.2). There is, however, also another 
expanded pronominal prefix, the TRANSITIVE 
PREFIX, in which each of the constituents 
contains a pronominal morpheme; one con- 
situent, the sussEcT, denotes the performer 
and the other, the opsect, denotes the goal 
of whatever is indicated by the verb stem. A 
transitive prefix almost never occurs with a 
noun stem.® The following diagram indicates 
the general structure of verbs containing this 
construction : 

















Be Verb 
Modal Transitive Prefix | Verb 
: Prefix Subject | Object Stem 





An example is s|k- second person|first person 
in skihsa:s you’re looking for me. The order 
of the constituents varies, depending on the 
particular morphemes involves, and in some 
tases the object contains the objective mor- 


§ An example of such an exceptional occurrence 
is shetwa- we (incl. pl.) . . . him in shetwéhse: no? 
he is famous or notorious among us, with the noun 
stem -hsend? name, fame. 


SENECA MORPHOLOGY III: EXPANDED PRONOMINAL PREFIXES 


229 


pheme, thus defining another construction 
subclass (2.15) of the objective prefix de- 
scribed in 10. In addition, certain of the 
transitive prefixes occur as constituents with 
the number morphemes, defining another 
subclass of the transitive construction. 

11.2. First to be considered are the transi- 
tive prefixes containing a first person subject 
and second person object or vice versa. The 
constituents are simply the first person 
(k-/-ke-) and second person (s-/ta-/-o[:]-/ 
-o[: ]y-) morphemes, and the subject precedes 
the object: koyétha? I’m hitting you, skye- 
tha? you’re hitting me. As subjects, the first 
and second person allomorphs are k- and s- 
respectively, except that in a word that ends 
in the imperative morpheme the second 
person allomorph is ta- before the first 
person: takoh give it to me! (-0-). As an ob- 
ject, the first person morpheme has the al- 
lomorphs -k-/-ke-, conditioned as described 
in 6.4. The second person morpheme has, as 
an object, the allomorphs -a(:)- and -a(:)y- 
(4.6), with -o(:)- occurring before a mor- 
pheme whose basic allomorph begins with a 
consonant or i (and the i is lacking), and 
-0(:)y- occurring before a morpheme whose 
basic allomorph begins with a vowel other 
than i: ko?tyo? I’ve seated you, koya:wi:h 
I’ve given it to you (-awi-). 

11.3. The two constitutes described above 
also occur followed by the dual and plural 
morphemes, in which case the second person 
allomorph is -0- as the object, and the al- 
lomorphs of the number morphemes are 
identical with those described for their occur- 
rence after the first person and inclusive 
person morphemes in 9.7-8. The grammati- 
cal meaning here is unique: the number mor- 
pheme sets a limit on the number of indi- 
viduals denoted by either of the constituents. 
With the dual morpheme either the subject 
or the object, or both, refers to a group of 
two, but neither can refer to a larger group: 
khniyétha? I’m hitting you (du.), we (du.) are 
hitting you (sg.), or we (du.) are hitting you 
(du.); skhniyétha? you (sg.) are hitting us 
(du.), you (du.) are hitting me, or you (du.) 
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are hitting us (du.). With the plural mor- 
pheme at least one of the constituents refers 
to a group of three or more: kwayétha? I’m 
hitting you (pl.), we (du.) are hitting you (pl.), 
we (pl.) are hitting you (sg.), we (pl.) are hit- 
ting you (du.), or we (pl.) are hitting you (pl.); 
skwayétha? you (sg.) are hitting us (pl.), you 
(du.) are hitting us (pl.), you (pl.) are hitiing 
me, you (pl.) are hitting us (du.), or you (pl.) 
are hitting us (pl.). 

11.4. The masculine morpheme (h-/-0-/ 
sh-/she-/he-) occurs as the first constituent 
in the transitive prefix before any of the sub- 
jective or objective morphemes or construc- 
tions (6, 8, 9) that contain the first, inclusive, 
or second person morphemes, as well as be- 
fore the UNRESTRICTED THIRD PERSON mor- 
pheme (-owo-/-owee-/-owe-/-awoy-/-ow-)’ he, 
she, they, the feminine singular objective con- 
struction, and the NONSINGULAR THIRD PER- 
son objective construction. If the second 
constituent is objective (contains the objec- 
tive morpheme), it is the object; if subjective, 
it is the subject and the masculine morpheme 
is the object. But the masculine morpheme, 
whether subject or object, always comes 
first. 

11.5. Before the first person singular ob- 
ject the masculine allomorph is h- and the 
objective allomorph -a- (cf. 10.6): hakyétha? 
he’s hitting me. Before the second person 
singular object the masculine morpheme has 
a zero allomorph in word-initial position and 
the second person allomorph is -y- (cf. 10.7): 
yayétha? he’s hitting you. Before the first 
person dual and plural objects the masculine 
allomorph is sh- and the objective allomorph 
-o- (ef. 10.11): shokhniyetha? he’s hitting us 
(du.), shokwdyetha? he’s hitting us (pl.). Be- 
fore the second person dual and plural objects 
(10.12) the masculine allomorph is she-: shes- 
niyetha? he’s hitting you (du.), sheswayetha? 
he’s hitting you (pl.). The maculine allomorph 
sh- also occurs before the feminine singular 
object, where the feminine allomorph is -ak- 
(cf. 10.8): shakéyetha? he’s hitting her. The 


7 Some idiolects have -awo- etc. 
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nonsingular third person objective construc. 
tion consists of the NONSINGULAR THIR) 
PERSON morpheme (-ak-) they, followed by 
the objective morpheme -0(:)-/-0(:)y-, ho. 
monymous with and conditioned in the sam 
way as the second person allomorphs de. 
scribed in 11.2. It is preceded by the mascv. 
line allomorph h-: hakbyetha? he’s hittin 
them, hakoyawi:h he’s given it to them. 

11.6. Before the first person singular sub. 
ject the masculine (object) allomorph is h- 
the first person allomorphs are -e-/-ey-, with 
-e- occurring before a morpheme whose basic 
allomorph begins with a consonant or i (and 
the i is lacking), and -ey- occurring before: 
vowel other than i: heyétha? I’m hitting him, 
he?tyo? I’ve seated him, heya: with I’ve giver 
it to him. Before the second person singular 
subject the masculine allomorph is he-, the 
second person allomorph -’hs-/-’hse-, con- 
ditioned as described in 6.5, but with -'hs. 
and -’hse- in place of s- and se- and with 
-‘hs- rather than j- before i or y: hehsa: wi:h 
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you’ve given it to him, hehsi?tyo? you’ve seated 


y occur 


him. Before the exclusive dual and plural} |omorpl 


subjects the masculine allomorph is sh- and 
the exclusive allomorph -a- (cf. 9.3): shakh- 
niyetha? we (excl. du.) are hitting him, shak- 
wayetha? we (excl. pl.) are hitting him. Before 
the inclusive and second person dual and 
plural subjects (9.4-5) the masculine al- 
lomorph is she-: shehniyetha? we (incl. du.) 
are hitting him, shetwdyetha? we (incl. pl.) 
are hitting him, shesniyetha? you (du.) are 
hitting him, sheswayetha? you (pl.) are hittin 
him. Before the unrestricted third pesron 
subject the masculine allomorph is h-; the 
UNRESTRICTED THIRD PERSON morpheme has 
the allomorphs -owo-/-owse-/-owe-/-owoy-/ 
-ow-, with -owo- occurring before a mor- 
pheme whose basic allomorph begins with a 
consonant other than (C)z or (y)o, -ows- oc 
curring before (C)x, -owe- before (y)o andi 
(and the i is lacking), -owoy- before o or 9, 
and -ow- before a, e, ore: howdyetha? he, she, 
they are hitting him, howé?tyo? he, she, they 


8 The last two forms are homonymous with 
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their reverse, he... you (du. or pl.) (11.5). 
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11.7. Another morpheme which occurs 
oily in the transitive prefix is the RESTRICTED 
THIRD PERSON (?-/?e-/-he-/-hey-) she, they, 
excluding the masculine singular meaning 
but including the masculine nonsingular. 
This morpheme occurs in a way that is simi- 
lar but not identical to the occurrence of the 
masculine morpheme described in 11.4-6. 

11.8. As a subject it precedes the first and 
gcond person nonsingular objective con- 
struction (10.11-12), which, however, occur 


or i (and) only with the nonsingular number morpheme 


before a 
ting him, 
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 he-, the 
se-, con 
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and with 


(-hi-/-hiy-/-i-/-iy-) rather than with the dual 
and plural. Its allomorph is ?- before the 
objective morpheme (-9-) that precedes the 
frst person, and ?e- before the second person 
morpheme (here -ts-), which conditions a 
wro allomorph of the objective. The non- 
singular allomorphs -hi-/-hiy- occur after the 
first person morpheme, and -i-/-iy- occur 


hsa : wi:h} after the second person. The forms without 
’ve seatei} y occur before a morpheme whose basic al- 
id plural lomorph begins with a consonant or i (and 
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the i is lacking); the forms with y before a 
vowel other than i. Examples are ?okhiye- 
tha? she, they are hitting us, ?okhiyawi:h she, 
they have given it to us, ?etsiyetha? she, they 
are hitting you (nonsg.), ?etsiyawi:h she, they 
have given it to you (nonsg.). 

11.9. As an object, unlike the masculine 
morpheme, the restricted third person FoL- 
Lows the first (k-) and second (s-) person 
subjects. In this environment the restricted 
third person allomorphs are -he-/-hey-, the 
former occurring before a morpheme whose 
basic allomorph begins with a consonant or i 
(and the i is lacking), the latter before a 
vowel other than i: kheyétha? I’m hitting her, 
them, kheya:with I’ve given it to her, them, 
sheyétha? you’re hitting her, them, sheya:- 
with you’ve given it to her, them. 

11.10. The restricted third person object 
does, however, precede its other subjects, the 
exclusive, inclusive, and second person non- 
singular, which (as in 11.8) occur with the 
nonsingular morpheme rather than the dual 
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and plural. The nonsingular allomorphs are 
-hi-/-hiy-/-i-/-iy-, conditioned as in 11.8, 
with -hi-/-hiy- occurring also after the in- 
clusive person morpheme. Before the ex- 
clusive person nonsingular subject the re- 
stricted third person allomorph is ?- and the 
exclusive person allomorph -a-: ?akhiyetha? 
we (excl.) are hitting her, them. Before the 
inclusive person nonsingular subject the re- 
stricted third person allomorph is ?e-; the 
inclusive person allomorph is -t-: ?ethiyetha? 
we (incl.) are hitting her, them. Before the 
second person nonsingular subject the re- 
stricted third person allomorph is also ?e-; 
the second person allomorph is -ts-: ?etsfye- 
tha? you (nonsg.) are hitting her, them.® 

11.11. It may have been noticed that the 
restricted third person morpheme does not 
occur as subject with a first or second person 
singular object. Instead, its meanings are 
divided between two other morphemes: the 
feminine (?-/?e-),!° already familiar, and a 
new one, the NONSINGULAR THIRD PERSON 
(h-/ho-) they. The first and second person ob- 
jective construction follows the subjects, and 
occurs in the allomorphs already described 
(10.6-7), but the objective allomorph is -o- 
before the first person. The feminine and non- 
singular third person allomorphs are ?- and 
h- respectively before the first person object, 
?e- and ho- before the second person: ?okyé- 
tha? she’s hitting me, hokyétha? they’re hitting 
me, °esdyetha? she’s hitting you (sg.), hosé- 
yetha? they’re hitting you (sg.). 

11.12. There is also a transitive constitute 
consisting of a neuter subject and neuter 
object. As the first constituent, the neuter 
morpheme has the shape k-; as the second, 
the shape -ow9-/-owe-/-owe-/-owoy-/-ow-, 
homonymous with and conditioned in the 
same way as the unrestricted third person 
morpheme described in 11.6: kow5yetha? 7t’s 
hitting it, kowo: with 7t has given it to tt. 

11.13. Finally, there are three other mem- 
bers of the transitive construction which 

® This form is homonymous with its reverse, 


she, they... you (nonsg.) (11.8). 
10 Some idiolects have ?o- rather than ?e-. 
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occur as constituents with a number mor- 
pheme (cf. 11.3), in this case the nonsingular 
morpheme (-ti-/-n-). The transitive con- 
stituents are sh|ako- he|her (11.5) and two 
others not yet encountered. These have the 
masculine (h-) and nonmasculine (k-) mor- 
phemes as objects, followed by the restricted 
third person morpheme (-awoa-/-owe-) as sub- 
ject. The nonsingular morpheme in all three 
cases applies semantically to the first of the 
two constituents, whether it is subject (as 
with shako-) or object (as with howo-/howe- 
and kowo-/kowe-). The nonsingular al- 
lomorphs -ti- and -n- are conditioned as de- 
scribed in 10.13. The restricted third person 
allomorph -owo- occurs before -ti-, -owe- be- 
fore -n-. Examples are shakotiyétha? they 
(masc.) see her, shakonowi:h they (masc.) 
have given it to her, howoatiyétha? she, they are 
hitting them (masc.), howenowi:h she, they 
have given it to them (masc.), kowotiyétha? 
she, they are hitting them (nonmasc.), koweno- 
wi:h she, they have given it to them (nonmasc.). 

11.14. The word-initial indicative allo- 
morph is ?0?- before any transitive prefix 
beginning with k or s. It is wa- before any 
other. The optative and future allomorphs 
are ?aa- and e- respectively before any 
transitive prefix. 

11.15. After any modal prefix, any transi- 
tive prefix beginning with h in word-initial 
position followed by a vowel occurs without 
the h, but with the preceding vowel basically 
strong: waosayet they hit you, ?4osa:yet they 
might hit you, ?eosayet they will hit you. Also 
after any modal prefix, the masculine allo- 
morph is -’:- rather than -0- before the 
second person singular object (cf. 11.5): wa:- 
ya:yet he hit you, A: yayet he might hit you, 
?e:ya:yet he'll hit you. After the optative 
and future morphemes, but not the indica- 
tive, any transitive prefix beginning with ? 
occurs with y in place of the ?: ?4:yokhi: yet 
she, they might hit me, ?eydkhi:yet she, they 
will hit me. With these exceptions, the same 
allomorphs occur after the modal prefix mor- 
phemes as in word-initial position. 
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11.16. The following is a list of the trang. 
tive prefixes: 

ko(:)-/ko(:)y-I... you: ko:keh I see you, 
°o?ko:ke? J saw you, ?eko:ke? I’ll see you, 

sk-/ske-/tak-/take- you... me: sketkeh 
yousee me, ?o0?ske: ke? yousaw me, ?eske tke! 
you'll see me. 

khni-/khni/khny- J, we...you (du) 
(11.3): khni:keh J, see we you, ?o?khni:ke’ 
I, we saw you, ?ekhni:ke? J, we will see you, 

kwa-/kwee-/kwe-/kw-/ky- I, we... you 
(pl.): kwa:keh I, we see you, ?0?kwa:ke? | 
we saw you, °ekwa:ke? I, we will see you. 

skhni-/skhn-/skhny- you . . . me, us (du,): 
skhni:keh you see me, us, ?0?skhni:ke? you 
saw me, us, ?eskhni:ke? you'll see me, us. | 

skwa-/skwe-/skwe-/skw-/sky- you. ..| 
me, us (pl.): skwa:keh you see me, w, 
?o?skwa:ke? you saw me, us, ?eskwa:ke’ 
you'll see me, us. 

hak-/hake- (/-’ak- etc.) he... me: hake: 
keh he sees me, wa:ke:ke? he saw me, ?eoke:- 
ke? he'll see me. 

ya-/ye-/ye—/y- (/-':ya- ete.) he... you: 
ya:keh he sees you, wa:ya:ke? he saw you, 
°er:ya:ke? he'll see you. 

shokhni - /shokhn -/shokhny- he... 
(du.): shokhni: keh he sees us, ?0?shdkhni: ke’ 
he saw us, ?eshdkhni:ke? he’ll see us. 

shokwa - /shokwe - /shokwe - /shokw - 
shoky- he... us (pl.): shokwa:keh he sees us, 
?o?shokwake? he saw us, ?eshokwake? he'll 
see us. 

shesni-/shesn-/shesny- he... you (du.): 
shesni:keh he sees you, ?0?shéshni:ke? hy 
saw you, ?eshéshni:ke? he’ll see you. 

sheswa - /sheswe- /sheswe - /shesw - /shej- 
he... you (pl.): sheswa:keh he sess you, 
?o?shéswake? he saw you, ?eshéswake? he'll 
see you. 








shako-/shaka-/shakaw- he... her: shako:-| ; 


keh he sees her, ?o0?shako:ke? he saw her, 
?eshako: ke? he’ll see her. 
hako(:)-/hako(:)y- (/-’ako[:]- ete.) hel 
... them: hako:keh he sees them, wa:ko:ke’ 
he saw them, ?eoko: ke? he’ll see them. 
he-/hey- (/-’e- etc.) I... him: he:keh | 
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see him, wa:eke? I saw him, ?e:eke? or 
Ye:eke? I’ll see him (3.16). 

héhs-/héhse- (/-’éhs- ete.) you... him: 
hehse:keh you see him, waehse:ke? you saw 
him, ?6:hse:ke? yow’ll see him. 

shakhni-/shakhn-/shakhny- we (excl. du.) 
_.. him: shakhni: keh weseehim, ?0?shakhni:- 
ke? we saw him, ?eshdkhni:ke? we’ll see him. 

shakwa - /shakwe - /shakwe - /shakw - / 
shaky- we (excl. pl.) ... him: shakwa:keh we 
see him, ?o0?shakwake? we saw him, ?eshak- 
wake? we'll see him. 

shehni-/shehn-/shehny- we (incl. du.) ... 
him: shehni:keh we see him, ?0?shéhni: ke? 
we saw him, ?eshéhni:ke? we'll see him. 

shetwa - /shetwe - /shetwe - /shetw - / 
shety- we (incl. pl.) ... him: shetwa:keh we 
see him, ?o?shetwake? we saw him, ?eshet- 
wake? we'll see him. 

shesni-/shesn-/shesny- you (du.)... him: 
shesni:keh you see him, ?o?shésni:ke? you 
saw him, ?eshésni:ke? yow’ll see him. 

sheswa- / sheswe- / sheswe- / shesw- / 
shej- you (pl.)... him: sheswa:keh you see 
him, ?o?shéswake? you saw him, ?eshéswake? 
you'll see him. 

howo- / howee- / howe- / howoy- / how- 
(/-'owo- ete.) he, she, they . . . him: howo:keh 
he, she, they see him, waowoke? he, she, they 
saw him, ?eowoke? he, she, they will see him. 

%9khi-/?okhiy- (/-yokhi- ete.) she, they 
... Us: 29okhi: keh she, they seeus, wa? dkhi: ke? 
she, they saw us, ?eydkhi:ke? she, they will 
see US. 

*etsi-/?etsiy- (/-yetsi- etc.) she, they... 


Ff 


| you (nonsg.): ?etsi:keh she, they see you, 


waétsi:ke? she, they saw you, ?eyétsi: ke? 
she, they will see you. 
khe-/khey- I...her, them: khe:keh I 


| see her, them, ?0?khe:ke? I saw her, them, 


: shako:- 
saw her, 


etc.) he 
a:ko:ke?} 
l 


he:keh | 


*ekhe: ke? I’ll see her, them. 


SENECA MORPHOLOGY III: EXPANDED PRONOMINAL PREFIXES 


233 


she-/shey- you...her, them: she:keh 
you see her, thei, ?0?she:ke? you saw her, 
them, °eshe:ke? you'll see her, them. 

?akhi-/?akhiy- (/-yakhi- ete.) we (excl.) 
... her, them: ?akhi:keh we see her, them, 
wa?dkhi:ke? we saw her, them, ?ey&khi: ke? 
we'll see her, them. 

ethi-/’ethiy- (/-yethi- etc.) we (inel.) 
... her, them: ?ethi:keh we see her, them, 
waéthi:ke? we saw her, them, ?eyéthi:ke? 
we'll see her, them. 

?etsi-/?etsiy- (/-yetsi- etc.) you (nonsg.) 
... her, them: ?etsi:keh you see her, them, 
waétsi:ke? you saw her, them, ?eyétsi: ke? 
you'll see her, them. 

?ok-/?ake- (/-yok- ete.) she...me: 
?oke:keh she sees me, wa?oke:ke? she saw 
me, ?eyoke:ke? she’ll see me. 

hok-/hoke- (/-’ok- etc.) they... me: 
hoke: keh they see me, waoke: ke? they saw me, 
?eake:ke? they'll see me. 

?esa-/?esee-/?ese-/?es- (/-yesa- ete.) she 
... you: ?esa:keh she sees you, wa?esake? 
she saw you, ?eyesake? she'll see you. 

hosa-/hosee-/hose-/hos- (/-’osa- ete.) they 
... you: hosa:keh they see you, waosake? 
they saw you, ?eosake? they’ll see you. 

kowo-/kowee-/kowe-/kowoy-/kow- it... 
it: kowo:keh it sees it, ?0?kowoke? 7t saw it, 
?ekowoke? 7t will see it. 

shakoti-/shakon- they (masc.)... her: 
shakoti: keh they see her, ?0?shakoti:ke? they 
saw her, ?eshakoti:ke? they'll see her. 

howoti-/howen- (/-’owoti- etc.) she, they 
... them (masc.): howoti:keh she, they see 
them, waowoti:ke? she, they saw them, 
?eowoti: ke? she, they will see them. 

kowoti-/kowen- she, they...them (non- 
masc.): kowoti:keh she, they see them, 
?o?kowoti:ke? she, they saw _ them, 
?ekowoti:ke? she, they will see them. 
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2. List of cognates, (Cont.), M-Y 
Single consonant correspondences 
The ‘intrusive’ nasal 

Consonant cluster correspondences 


gu wm go po 


2.2. MALE. PCA *na‘pe'w-a (Bl); Nt. 
nampvs (nompaas male of the human 
species). 

MAN. PCA *elenyiwa (BI) felenyiwaki, 
px.; Nt /ninwak/ (ninnuog men). It is not 
clear from Trumbull’s dictionary whether 
this is a Natick or Narragansett form 
(ninnuog) is listed in the English-Natick 
entries, but in the Natick-English part we 
find only (nnin) man which is starred, a de- 
vice used to indicate that the form ‘does not 
belong to the Natick dialect proper’. Under 
this starred form, which is Narragansett, is 
listed a plural (nninnuog, ninnuock). 

MEDICINE. See Roor. 

MEET. PCA *naékwe'skawe'wa (*naékw- 
+ -e- + -eSk-aw- by foot/body movement) he 
goes so as to correspond with the other (Bl); Nt 
nakeskaw (nogkushkau, nogkishkau he meets 
him). 

MIXED TOGETHER. PCA *kelek- (H); Nt 
kinak- (kenukshau it is mingled or mixed 
with, nuk-kinukkinum J mingle or mix). 

MOUTH: HIS ---. PCA *weto'n-i (M); Nt 
/woatun/ (wuttoon). 

Move. PCA *ma‘t-, *marcyithkani (F 
ma‘tithkani thous mightest move, C marti: 
hkan do thou later go on a hunt, O ma‘tirkkan 
do thou later go away) (Bl); Nt /meman- 
tenoam/ (mamontunnum he moves/puts it in 
motion), /memantit/ (mamonchit when he 
moves, cf. monchanaii he moves him). Cp. Nr 
matichase Be my guide!. 

MUCH. See BIG (2). 


* For Part I see IJAL 26.112-119 (1960). 


NAIL. PCA *-Skan8-a (Ha) (K.Ha neska&, 
C.L n’iskasiy, O.Ba nishkanj my nail, § 
-kaSe, -?sSkaSa); PCEA *-xkang-a (Ha) (D) 
nhikaS my nail); Nt /motkas/ (mitihke 
mtihkas the nail of a man, the hoof, claw « 





No. 3 


lematics 
PCA *? 
PLAY. 
(Bl); } 
plays, « 
puhptio 
PLUR. 
-ak/ ( 
POT. 
/abko®k 
earthen 





talon of an animal, cf. wuhkossog, wuhkassy 
his nails). 


NAME (vb). PCA *winle-wa (Bl) he name yr 


him, *wintamwa (Bl) he names it, tells it 
Nt /esuwinaw/ (ussowenau he names hin) 
/asuwitem/ (ussowetam he calls or names it) 
/asuwitemenvt/ (ussoowetamunat to give 
name to inanimate object). 

NEGATIVE. PCA *kaé- (or *kan-) (H)#! 
Nt /-v'kan/ the 2nd sg. of the imperativ 
negative is formed in (-Ahkon or -uhkon), ¢ 
(kummootuhkon) thou shalt not steal. 

NET. PCA *a?lapya (Bl); Nt /(h)esep 
(hash4b, hash4bp a net for fish, Ashéap, 
AshAppog vegetal fiber or fibrous material usel 
for making thread or cord). For corre. 
spondence of PCA *?1: Nt/S/ep. FLY. 

NEW. PCA *weski (Bl); Nt /waski 
(wuske, weske young, new). 


OLD PERSON. PCA *ke?tesiwa (H); 
/kebéisw-it/ (kuhchisu-it when he is old, ¢. 
kehchisqua, kutchisqua an old woman, 
kehchiog, kutchiog the old [collectively]). 

oNE. PCA *nekot- (Bl); Nt /nikwot) 
(nequt). 

oTHER. PCA *kotak- (H); Nt /anketak, 
(onkatog another person, cf. onkatogig 
others). 


PLACE: HE ---S HIM. PCA *a?le'wa (B)); 
Nt /panaw/ (ponau). This set is to be ques 


POUN: 
(C.L ta 
fe 
in a 
Nt /tak 
he grind 

PRICK 
(koo, kc 
thorn, b 
Op. Pn 

PRON 
in/, ¢ 
pers. pr 
wa ye; 
kenauar 
(neenay 

PRON 
*ne- (EF 
SISTER 

PRON( 
PCA *k 
Bl); N 

PRON( 
*ye-, *) 
HEART, 

PRON‘ 
DETERM 
(nu-), ¢ 

PUT I 
/pitaw/ 

RED. . 


= 





tioned because of the unexplained initial 


consonant in Nt. For discussion of the prob- 


(nusqui 
RELA‘ 
I have 1 


19 Hockett; ibid, ‘“M has the negator with tht, ( 


shapes /kan/ and /kat/; it is this alternation thal 


points to *ka@- rather than *kan-.’’ 
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h nite 
D! ntal 
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\N matical correspondence of Nt /n/ with rock. PCA, PCEA *-a:peék-w- (BI) 
=e PCA *?] see FLY. stone, metal; Nt /-pisk/ (-pisk rock in com- 
PLAY. PCA *pa-hp-, *pa‘hpiwa he laughs pound words, cf. chippisk, chepisk rock, 
(Bl); Nt /pampu/ (pompu, pohmpu he crag), /-ampskw/ (-ompsk, ompsq_ rock 
plays, cf. pompuog, pohmpuog they play, in compound words, ef. kenompsq sharp 
puhptiog they play). stone). 
neskay| PLURAL, ANIMATE. PCA *-aki (Bl); Nt ROLL (vb.). PCA, PCEA *te-tep- (Ha); 
neia /ak/ (-og), ef. SNAKE. Nt /tatep.../ (tatuppequanum he rolls it 
Ha) (Dp) 207: PCA *aékehkw-a (Bl) kettle; Nt along, tattuppunau it is twisted, tuttuppun 
mth fa*ko"k, abko*kw-/ (ohkuk, ohkuhk, ahkuhq spun, twisted). 
lan earthen pot or vessel, ahkuhquog pl.). ROOT, MEDICINE. PCA, PCEA *we-ce: 
thkassy pounD. PCA, PCEA *takw- (?; Ha) pihk- (?; Ha); Nt /wotape*k/ (wadchabuk, 
((.L takkwahwew piler, Sh takhw- pound, wutchaubuk, -adch4buk [in compounds], 
hs nl grind for bread, Mi tahkwahikwani I grind wutchappehk root). The *hk is questioned 
tells it itin a mill, Me tekwtaaga to grind grain); by Haas since the Cree form shows no 
nce hee Mt /takehusu/ (togkuhwhosu, togguhwhosu evidence for it: C.W ochapik, eO uci-pikk, 
ened he grinds in a mill). Sh hoée'pi?k-i. It is likewise questionable 
to give « PRICKLY. PCA *ka:w- (H); Nt /kuwa/ for PCEA since, even though the Natick 
(ko, koowa, koowds pine tree, cf. kéus a form may be evidence for an *hk, the Me 
-) ae a en ea dimin. ste ?}). oochebusk points to *6k. 

. 1(p. Pn.Ha kd-wi quill (porcupine). f 5 AS, 
i PRONOMINAL BASE. PCA *-i'l- (Bl); Nt Prac ge panting po (H); wi raga 
ad | /in/, ef. PCA *ni‘la; Nt /nin/ (neen, nen) vil - of B “ di wt sis wt oa ark; 
/(h)exep pers. pron., 1st 8g. PCA *ki'la thou, *ki'lwa: a ma ae ree ° u ’ bye 
4 ashiy va ye; Nt /kinaw, kinawaw/ (kenau, jo 1es ith cn sahaatinnataaeed for 
rial use nauau you). PCA *nirlana; Nt /ninawon/ Scrape and Dry.” Cf. also F.H kahkah- 
ae ueenawun) pers. pron. we (excl.). kya‘wi . is rough Cocke the set under 
ay PRONOMINAL PREFIX (FIRST PERSON). PCA Hockett’s reconstruction kaHk- (*H: *h 

wash ‘ne- (BI); Nt /no-/ (nu-), ef. RELATED, OF *@) sharp and Nt (koshkeyéue) roughly 
‘\ssrer (1). (a coat not soft). 
PRONOMINAL PREFIX (SECOND PERSON). sea. PCA *ke?tikami (H); Nt (kehtoh- 
(H); N}PCA *ke- (Bl); Nt /ka-/ (ku-), e.g. *kete-hi han, kehtahhan any sea, kehtoh, keihtoh 
- old, ch Bl); Nt /kote(-)/ (kuttah) thy heart. the ocean). Cp. Nr kitthan, wechékum the 
womali! PRONOMINAL PREFIX (THIRD PERSON). PCA sea. SEE BIG (2). 
vely |). |tve-, *w- (BI); Nt /wo-, w-/ (we-, w-), ef. SEE: HE ---S HIM. PCA *ne-we-wa (BI); 
/nikwot|fgarT, LEG, TONGUE, TOOTH. Nt /nvw/ (ndaw). 
PRONOMINAL PREFIX (THIRD PERSON, IN- seep. PCA *-6kan (Ha), *min- (Ha), 
— DETERMINATE). _— 7 (BI); Nt /ma-/  *mi-nali (B1) berries; Nt /skenimanes/ 
Onkalvoglimu-), cf. THROAT, KNEE. esp (skannémunash seeds, seed corn), /-mini/ 

. laae sane ss ots bom within; Nt (-minne berry in compound names, cf. 
-wa, (B])) pitaw au ts 4 , ; 5 ; 

‘ be val RED. PCA *meskw- (Bl); Nt /mo'| Skwi/ nee Se 1 oe 
ed initial(musqui, mishque, msqui 7t 7s red). pitas 


the prob] RELATED. PCA *netefa‘nkoma‘waki (G) 


| have them as relatives; Nt /notinankemo 


or with thtwak/ (nuttinonkumdéog my kinsman). Cp. 


nation th 


"Sh nitela‘ko‘ma'ki I am related to them, 
Dl ntalanki‘ma J’m related to him. 


SHAKE. PCA *neki- (?; Ha) (Sh naneky-, 
neneki, shaking nervously); Nt /nokimu, 
nonokimu/ (nukkemoo, nunnukkimoo zi 


20 Haas, Mary R.; Notes on Some PCA Stems 
in /k-/, IJAL 24.241-245 (1958). 
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shakes, it is shaken). Natick, like Shawnee, 
shows reduplication. 

sHARP. PCA *kin-, *kirn-t (Ha), *kirn-l 
(H); Nt /ki*tetaw/ (kéhtadtau, kehtattau 
he sharpens), /kin-, kini-/ (kenai, kéneh 
it is sharp, kenompsk a sharp stone, keneh 
quog a sharp knife). 


sHor. PCA ‘*madkesini (Bl); PCEA 
maxkesini (S); Nt /makas, makesines/ 


(mokus, mokis shoe, mokussinash shoes). 

sHoot. PCA *pemwe'wa (Bl) he shoots 
him; Nt /pipamwaw/ (pepumwau he shoots 
at him). The Nt form shows reduplication. 

sHortT. PCA *tahkw- (Ha), ttahkwisiwa 
he is short; Nt /tia*kwsiw/ (tiohqoosue 
he is short, ef. tidqui it is short). Cp. Nr 
tiaquénqussu he is short. 


sIsTeR (1). PCA ‘*nemihsa (Bl) my 
elder sister; Nt /namisis/ (nummissis my 
sister). 

sisTeER (2). PCA *-ittekeHkw- (BI; 


*H: ?) sister (woman speaking), {ni‘teke- 
Hkwa my sister, twittekeHkwa her sister; 
Nt /niteke'|Skw/ (netukkusq my _ sister), 
/witak(i)*|Skwa/ (wetukishquoh, weetuks- 
quoh her sister). 

sit: He ---s. PCA *apiwa (Bl), fapiwaki 
they sit; Nt /epw/ (appu he sits), /epwak/ 
(appuog they svt). 

SLEEP wiTH. PCA *wi'épe-me'wa (Bl), 
PCEA *wi'xpe'me'wa he sleeps with him; 
Nt /we'pemaw/ (wehpamau he has sexual 
connection with her). 

SNAKE. PCA, PCEA ‘*aékorka, -aki pl. 
(S); Nt /eskuk, eskukak/ (askook, askookog 
pl.). 

snow (n.). PCA *kon (H); Nt /kun/ 
(koon). 

sNow: IT ---s. PCA *me@ponw-i, PCEA 
mexponw-i (S); Nt /mo*puwi/ (muhpoowi). 

sorTeN. PCA “*lo‘6k-ah- (G) soften by 
pounding; Nt /nubki/ (noohki adj. soft, 
ef. noohkik that which is made soft, 1.e. meal, 
flour, ground corn). Cp. Nr nékehick parched 
meal, Pn.Ha néhkiken it’s soft. 

SOILED. See EXCREMENT. 

so MANY. PCA “*tahé-, *tahSiwaki they 
(an.) are so many (Bl); Nt /tasv, ta'Siw/ 
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(tohsi, tohshéu how much), /ta*swak| (nunoo 
(tohsuog how many (an.), cf. tohsunasi ling). 
how many (inan.), tohshe how much)| sum 
Notice the recording in Natick of whaj (népur 
would seem to be /8/ before /i/ and /s}| swe 
elsewhere. If this alternation has an} (weekc 
reality, a problem to be solved by a mon} tastes g 
detailed synchronic examination of th} rast 
Natick corpus available, we have here aif Nt/sp< 
example of Natick reflecting the PC iste, f 
morphophonemic alternation between *(C}f pickq 
and *(C)S. TEAC 
SOUTH: IN THE ---. PCA “*Sa'wanonll yewa 
(Ha); Nt /sawenabki (sowanohke the soul Nt /kc 
country). of [me 
spit. PCA *sekw-, *sehkw-i (?; Ha)fkuhkoo 
Nt /satk..., sakw/ (suhkou he opi TELL 
so0kq, sohq spitile, suhkesu he is a spitte|(G); N 
sukkissu long clam, spitter). Cp. Sh 6ekn{Sh ha, 
spit, Cr(W) si’koo spit (v.7.). ota) t 
sporreD. PCA *mya‘l- (Bl); Nt /m) tac 
moni/ (mémone freckled). The Natick on @f); ] 
shows reduplication. kpakin 
sprout (vb). PCA ‘*sa‘kihédenwi (M} ran 
it sprouts; Nt /sanki... / (sonkin <t spring thinks 
up [as a plant)). (anante 
sTaAND. PCA *-ka:pawi- (Bl); Nt /kaij think). 
paw/ (kompau he stands erect [occurs onl} THRE 
in compounds], cf. sampookompau he stanif /nis/ ( 
upright). the thare 
star. PCA *afa‘nkwa (Bl); Nt /enakw} taro 
(anogqs). The final /s/ in Natick is prokani ( 
ably the diminutive suffix. mokw 
STEAL. PCA *kemo-twa (B1); he steals ifnit/ (k 
Nt /kamutv/ (kommooto, kummooto 4 Nr git 
steals). THUM 
STONE. PCA ‘*a?senyali (BI) — ston} /kettvk 
Nt /kwinesinonk (quinahsinnunk a pesllf uhquae 
i.e. long stone < qunni, quinne long, hassu cheash 
assin stone). TOR: 
stronG. PCA *sa‘pw- (G)_ strong; M /ketvk 
/sampwi/ (sampwi straight, right, juifuhquae 
Cp. Nr satimpi straight, Qu.T sompiiroor. 
it is right. This set is questionable because 
the possibly tenuous semantic relationshi 
streTcH. PCA *Si:pi (Ha); Nt /sipl 
(sepe it spreads out, extends). 
suckLE. PCA *no:n- (BI); Nt /nonul 





























TONG 
Nt /wi 

TOOT! 
/wipit/ 

TOP, 
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ta*swak}(nunoon J suck), /nunok/ (noonuk a suck- 
ohsunaslf ling). 

» much\} suMMER. PCA ni‘pin (Ha); Nt /nipon/ 
of whaj(népun the latter part of the summer). 
and /3| sweet. PCA *wirnk-an- (G); Nt /wikan/ 

has anj(weekon 2 is sweet). Cp. Sh wiikanwi 7 
y & mor tastes good. 

of thf taste. PCA, PCEA “*e@pokw- (Ha); 
> here al Nt/spo*kwat/ (spukquodt, spuhquodt the 
the PC taste, flavor of a thing). Cp. Nr teAqua as- 
een *(Cifpickquat what does it taste of. 

TEACH. PCA *keékinw-, *ke#kino-hama 
‘a wanonif we'wa (H) he teaches it to him (trans. an.); 
the soulfNt /ko*kin ... / (kuhkinneam he takes note 
of (mentally or by observation, v.t.], cf. 
(?; Ha\{kuhkootumwehteau he teaches [caus. inan.]). 
he spit} TELL NEWS: HE ---S. PCA *a:cimoke'wa 

a spittel (G); Nt /an¢imukaw/ (aunchemookau). Cp. 
Sh 6eki{Sh haatimo he talks, haat- (plus -i-m-, 
ota) talking, Mi atimokiani J boast. 

Nt /m} vuick: it is ---. PCA *keXpakya:wi 
atick fomf(H); Nt /kepaki/ (kuppogki). Cp. Sh 
kpakinwi zt cs thick (< pre-Sh *kV?pakinwi). 
nwi (Mj) rainx. PCA *-e'le- (Bl), te@e'letamwa he 
1 at spring thinks so (O inentam); Nt /enentem/ 

fandntam he thinks, cf. nuttenantam I 

Nt /kat} think). 
ccurs onl} THREE. PCA *ne?6-, *ne?@wi (Bl); Nt 
1 he stand /niS/ (nish three), niSwi (nishwe, nishweu 








the third). 
; /enakwi}] THROAT: SOMEONE’S ---. PCA *mekonta: 
‘k is pel an (M); *ko(-)nta‘kani (H) throat; Nt 


mekwotenk/ (mikquttunk), /kekwotenke 
he steals itnit/ (kuk-quttunkanit to thy throat). Cp. 
nmo0to Nr gittuck throat. 

THUMB. PCA *ke?cifencyi (H); Nt 
A /ketvkwenié/ (kehtequanitch [keht great, 
fuhquae end of, -nutch hand], kehtequanit- 
Hcheash pl.). SEE BIG (2), FINGER. 

TOE: BIG ---. PCA *ke?cisita (H); Nt 
strong; M /ketvkwesit/ (kehtequaseet: keht great, 
ight, juituhquae end of, -seet foot). See Bia (2) 











T sompifroor. 
> because} TONGUE: HIS ---. PCA “*we'@an-i (M); 
elationshij Nt /winen/ (wénan, weenan). 


Nt /si} roorn: nis ---. PCA *wicpit- (Bl); Nt 
/wipit/ (weepit). 


Nt /nenu} top, on. PCA, PCEA “*wadkici (S) on 
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top, on the surface; Nt /waskiti/ (woskeche 
on the surface of). 

TREE. PCA *me?tekw- (Bl); Nt /me> 
takw-/ (mehtug, mehtugq tree, mehtug- 
quash, matugquash trees); PCA *-a-htekw- 
(Bl); Nt /wehtetkw/ (wuttuhq, wuttuk, 
wuhtuk, bough, branch, ef. wuttoohqtinash 
wood pl.). 

Try. PCA *kot-, *koci (part.) as preverb 
try to, *kotihe-wa he tries, tests him, *koci? 
ta'wa he tries, tests it (H); Nt /kwatihaw/ 
(qutchehheau he tempts, proves him), kwati> 
taw (qutchehteau he proves it). 

two. PCA *ni‘8-, niSwi (Bl); Nt /nis, 
niswi/ (neese, neeswe). 

UNCOVER: HE ---S 1T. PCA. PCEA *pa: 
ékenamwa (8S); Nt /paskinom/ (poskinum 
he bares it). 

URINATE. PCA *Seki- (M); Nt /sekitak/ 
(noh ségketog, noh sagkeet he who urinates). 


view (vb.). PCA *wensa‘pame'wa (BI) 
he sees him from there; Nt /wasempvnet/ 
(wussampenat to view, look out from). 


WALK BY. PCA *pemohée-wa (M) he 
walks by; Nt /pam&aw/, (pomsheau, pom- 
shau it passes, goes by, cf. noh pomushadt, 
pamwushadt he who walks). 

watcH (vb.). PCA, PCEA *aék- (S) 
watch, lie in ambush; Nt /eskahom/ (askuh 
hum he watches, waits for it). 

waTeR. PCA *nepyi (Bl); Nt /napi/ 
(nippe, nuppe). 

wHiITE. PCA *wa:p- (Bl); Nt /wampi/ 
(wompi there is white). 

wHo. PCA *awen- (Bl); Nt /hawen/ 
(howan). 

wipow. PCA, PCEA ‘*si‘kawefkwe'wa 
(S); Nt /sikaw'|Skw/ (sekousq, sekauishq, 
sekousquaog pl.). See WOMAN. 

WINTER. PCA, PCEA *pepo'nw- (Ha); 
Nt /pvpvn/ (popén, pdépon 7 is winter). 

wito. PCA *wit-, *wié- (Bl); Nt 
/witamaw/ (weetomau she dwelt with her), 
witi (weeche with, in company with person 
or animate object). 

woman. PCA, PCEA *eékwe'wa, (B1); 
Nt /(e)*|8kw, eskwes/ (squa, eshqua female, 
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squéas, squaus a woman with termination 
denoting a living creature). 
woopcuuck. PCA, PCEA *akwaékwa 
(S); Nt /akaSkwak/ (ogkoshquog conies). 
WORK: HE ---S. PCA *afoékye'wa, PCEA 
*afoxkye'wa (S); Nt /enekawsu/ (ana- 
kausu, cf. anakausish labor thou!). 


YELLOW. PCA ftwesa'wa (C.L osdwaw, 
O osAwa it is yellow, Sh hoéaaw it zs yellow) ; 
Nt /wisowi/ (weesde yellow). See BITTER. 

YESTERDAY. PCA *weé@a‘kw- (Ha); Nt 
/wenanku/ (wunnonkou, wunnonkoo 7 
was evening, yesterday, last evening). Cp. Me 
oolagod evening, Mi alakiaé yesterday, Nr 
wunnauquit evening. 


3. Examples of the PCA--Natick single 
consonant and semivowel correspondences 
are cited below according to English gloss. 
See Chart II for a comparison of the PCA 
reconstructed phonemes and their F, C, 
M, O and Sh reflexes with the Natick cor- 
respondences.”! 





CHART II 

F C M O  §&h PCA Nt 
n i,2 ‘n n l *9 n 
n y n n l *| n 
n n n n n *n n 
se eS ce J y 
tec t¢¢ tie t¢ t,¢ ie ee 
 -. YF *p p 
k k k k k *k k 
c c c c c *¢ é 
S s s s 6 % 5/8 
S s s § § | 8|§ 
h h h h h,?h *h h 
Ww Ww Ww Ww Ww *w Ww 
m mn om “nm m™ *m m 


*9: n. abandon, again, armpit, between, 
breast, dog, fight, finger, negative, related, 
star, think, tongue, work, yesterday. 


21 Although the PCA reconstructions were made 
on the basis of four Central languages only, i.e. 
F, C, M, and O, the Sh reflexes fit the reconstruc- 
tions; ef. Wick R. Miller, op. cit. 
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y 
#5; 


*1: n. bark, breathe, dig, dish, five, man, 


mixed together, pronominal base, soften) iretch, 
spotted, think. "h: 
*n: n. abandon, arm (see also bone, jau, *w: 


liver, mouth, throat, tongue), bend (see als teak ( 
catch hold of, wncover), breast, by hand, (see {am, 
also lift up), child, daughter, die, fish, gj”: 1 
after, hear, hold, kill, lick, long, lose, mal| th, 
man, meet, one (see also eight, two, three), walk b 
. prefix (first pers.), see, seed, shake] *m: 
snow (n.), south, stone, suckle, sum. fy, love 
sweet, teach, water, who, winter. (see al 
y: y. body, come, four. fear, fi 

*t: t. arm, brain, bring, daughter, day|™"), 
enough, feet, fire, freeze, heart (see also jay, fish, gy 
mouth, related), hold, hot, kick, knee, lick, prefs 
one (see also other), pound, roll, short, siste ated, 
(2), so many, steal, think, tooth. deal, te 

*t, *e: t, &. bad, eat, try, with. (See Sec. § 
*?t, 2c: 't, '¢). There is one example ¢ 
*t: >t (ef. deer); however, the /*/ may repre 
sent an attempt to indicate the quality a 
quantity of the preceding vowel. 

*p: p. along in space or time (see aly 
fly, live, shoot, walk by), bite, break (2) 
bring, burst, closed, come, die, dust, enough, 
enter, five, flea, foam, grease, net, play, pil 
into, roll, root, sit, stretch, summer, tooth 
uncover, water, winter. 

*k: k. abandon, big (2), bite (see als 
catch hold of), bitter, closed, day, dip out/w 
ear, eight, enter, eye, fear, fight, flea, foreheai 
freeze, hot, house, knee, long mixed togethel jy +r 
negative, one, other, plural (an.), poutiireeordiy 
prickly, pron. prefix (second pers.), scraponclys; 
sharp, sister (2), snake, snow (n.), stanihdetermi 
steal, teach, tell news, thick, throat, try, wifor the 
nate, widow, woodchuck. vowel a 

There are three examples of *k: ‘Hmalizati 
Again, this may not be a correspondencpothesis 
but may reflect orthographic convention » In { 
see break (1), taste, tree. 

*ce: &. child, root, top. 

*s: 5. bite (see also catch hold of), eat, f 
go after, shoe, spit, sprout, strong, wid 
yellow. 


shoe, 


mer, 
* 
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e, man, *§: s. break (1), day, eight, hot, south, 
soften sretch, two, urinate. 
*h: h. closed (see also dip out/up). 
ne, jan, *w: w. abandon, bend, between, bitter, body, 
see als} reak (1), dig, dish, eat, egg, eight, excrement, 
nd, (seel nam, four, hot, look, lose, new, prickly, 
fish, gh PO” prefix (third pers.), red, root, short, sit, 
se, malk, south, spit, stand, sweet, taste, top, tree, two, 
», three), walk by, white, who, with, woman, yesterday. 
1, shake *m: m. along in space or time (see also 
le, sum. fy, live, shoot, walk by), bad, bear, bee, bend 
(se also catch hold of, closed, dip out/up, 
jear, find, go after, kick, lick, lift, look, un- 
ter, day cover), big (1), dog, eat, excrement, fight, 
also jau fish, grandmother, grease, hair, move, pron. 
nee, lick prefix (third pers. indeterminate), red, re- 
lated, seed, shoe, sister (1), sleep with, spotted, 
seal, tell news, tree. 


nter. 


ort, siste 


ee Sec. §, 
ample ¢ 
lay repre 
quality ar 


4, In his article on “Algonquian nasaump 
and napopi” (Lg. 21.40-45), Geary points 
out that in the New England languages an 
original Proto-Algonquian nasal cluster 
wually loses its nasal and that those nasal 
dusters which appear are due to an ‘in- 
trusive nasal’ after 6 or au < PA “*a,. 
He suggests that the transcriptions by 
Eliot and others may actually represent a 
nasalized vowel plus consonant. The con- 
snant involved is always a stop (or the 
; affricate ¢). 
ip out/ “\ Examples of PCA *a-T (T: stop), *a-hT 
1, foreheat wrresponding to Nt vowel plus nasal-stop 
ed togetht\ ster are given below. The most frequent 
-), PoUtitrecording of the vowel is (0). There is no 
s.), scraMenclusive evidence that can be used to 
n.), staniidetermine its precise phonetic value, but 
it, try, Witor the time being I have normalized this 
vowel as /a/; ef. English pond.” This nor- 
of *k: 'Wmalization is presented as a working hy- 
»spondenc}pthesis subject to change. No assumptions 


(see als) 
break (2) 
st, enough 
play, pu 
ner, tooth 















(see al 


onvention » In the material I have examined thus far, 
wherever a PCA form contains *o and *o- the Nt 
reording is (00); cf. Eliot’s observation: ‘Es- 
ially we have more frequent use of o and oo 
han other languages have and our oo doth always 
ound as it doth in these English words, moody, 
book.’?. 


of), eat, f 
ong, wid 
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have been made about the length of the 
vowel. 

*a'p: amp. again, dip out/up, look, male, 
rock, stand, strong, view, white. 

*ahp: amp. jaw, play. 

*at: ant. leg, live. 

*a't(*arc): ant (ané); move. 

*a‘k: ank. dish, sprout, throat, yesterday. 

: */onk. bend. 

There are exceptions to this correspond- 
ence (cf. break (1) [Haas questions the PCA 
form], freeze, hide, lick). Because of lack 
of knowledge of the possible conditioning 
factors which may be in operation and 
because of the inconsistencies in transcrip- 
tion, these exceptions cannot be explained. 
However, if we look at Natick and Nar- 
ragansett we see that, insofar as one can 
tell from the orthography, they do not 
always agree in their treatment of the de- 
velopment of the nasal, e.g. Nt wosketomp, 
Nr eneskéetomp man in contrast with Nt 
wunnonk, Nr wunndug dish; Nt muk- 
quttunk, Nr qtttuck throat. 

This ‘intrusive’ nasal may be a secondary 
development in Nt and some other New 
England languages. If it is not, then it is a 
reflex of a PA (as opposed to PCA) feature. 

In order to solve the problem, a systematic 
comparative study of both the Eastern and 
Western languages is needed. It may not be 
entirely out of place to note, e.g. that 
Cheyenne appears to have hT corresponding 
to the ‘intrusive’ nasal + stop, cf. PCA 
*wa'pi, Nt (wompi), Ch vuhpi_ white; 
PCA *wela‘kani, Nt (wunnonk), Ch hi 
tuhkU dish. 


5. The PCA reconstructed consonant 
clusters with their F, C, M, O and Sh 
reflexes and the Natick correspondences 
are presented in Chart III according to the 
first member of the cluster.” 


23 The starred forms to the right of the Natick 
correspondences represent PCEA consonant 
clusters. 
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ee & 
php 
t ht 
k hk 
ec he 
s s 
s Ss 
ht 
n hy 
hp 
ht ht 
hk hk 
he he 
..i- 
BAB 
Ss i 
a 
p sp 
hp sp 
hk sk 
sp 
hk sk 
hp sp 
st 
Sk 8k 
Sk hk 
ee  ¢ 
ht st 
he sc 
g Me 
s st 
s hy 
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CHART III 
M O- Sh PCA 
hp mp p “*mp 
it. nt. 4 *nt 
hk nk k= “*nk 
he ne ec “*ne 
hs ns *ns 
hs nS § “*n8 
hn n *n6 
bn n n nl 
hp pp *hp 
bt tt | hi 
hk kk ?k “*hk 
he ce 7c *he 
hs ss 76 *hs 
hs & 75 *hi 
hn ss 76 *hé 
hn ss 76 *hl 
hp pp *p *Xp 
hp pp *p “6p 
hk kk ?k *6k 
ep pp *ep 
eck &k ?k *ck 
sp Sp Sp “*Sp 

St % 
sk &k &k “*Sk 
hk sk &k “sk 
M O- Sh PCA 
Mi Mh. Pet 
% ec %e *%e 
ae) ee 
2s «8S 2 es 
n ss 6 *76 
"oa OUT 


$ (n?) 


Examples of the PCA—Natick correspond- 
ences are cited below: 


* brain, lift up. 


‘enter (see also put into), hear, 


look, name, throat. 


. sharp. 


armpit, dust, kick, star, sweet. 


. south. 


VOL. XXV 
*ne é .come (proceed) from, _ finger, 
*ns S . view. 
*ns S . eye, nail. 
*nd : ? no examples. 
*nl : n.name. 


There are examples in which the nasi} 


cluster remains in Natick, ef. brain, re 
lated. The conditioning factors for they 
exceptions are unknown. 
*hp : ? ef. jaw, play. 
*ht : t. forehead, make. 
"Me. ear. 
*hk : k. go after, grandmother. 
: bk. deer, freeze, hold, louse, pot, 
root, short, spit. 


*he ? no examples. 

*hs : s. grandmother (see also sister [/)) 
*hS : 4S. so many (?). 

*ho 8. father, walk by (ef. sprout fa 


a possible exception to this correspondence) 

: bs, so many (?). 

*hl 8. breathe. 

*Xp: p. thick. 

*@p (PCEA *xp) : p. back. 
: 4p. arm, sleep with, snow (v). 
(PCEA *6@p) : sp. taste. 

*#k (PCEA *xk) : k. jaw, 


work. 


meet, sho 
> bk. break 
soften, teach. 
(PCEA *6k) : sk. back, bark, bow 
forehead, rock, seed, snake, top, wncove, 
watch. 


(2), land, leg, 


ppl 


Sk . woodchuck. 
*9k (PCEA *6k) : */Sk . bear, find, live 
woman (see also bitch, widow). Cf. sister (/ 
for *Hk (*H : ?) : */8k. 
*ep ? no examples. 
*ck : ? no examples. 
*Sp : Sp. above. 
*sk : sk. eye, new. 
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* rt 
person 
implie 
in N 
found 
forms 

* °t 

*9 Cc 





: Sk. fire, kick, meet, scrape. 
> °/Sk . burst. 

> bk. nail. 

: ? no examples. 

: sk. lick. 

: */8k . red. 


¥* st 
*sk 
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n, finger. 


the nasal 
brain, re. 
for thes 


use, pot, 


sister [7)) 


sprout foi 
pondence) 


(v). 
eet, — sho 
leg, pl 


park, bon: 
),  uncove 


find, live 
f. sister (! 





upe. 
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*t, *?o : bt he. big (2) (see also old 
person, sea, thumb, toe (big). Trumbull 
implies that this alternation is morphological 
in Natick; however, all the examples I 
found of big (2) occurring in combined 
forms point to phonological alternation. 

*?¢ 3 bt. fear, foam, tree, try. 

*2¢ : be. exhaust. 


#95 


S 
: 8/8. big (1). 
8 


~ 


s. bitter. 


: hg, fish, stone. 


s. hair. 
. eight. 


. between, bitch, fear, kill, three. 
. fly, net. 
n(?) . fly, place. 
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NOTES AND REVIEWS 


PEOPLE AND Puiaces. Margaret Mead. 
Illustrated by W. T. Mars and Jan Fair- 
servis; photographs. Cleveland and New 
York: The World Publishing Co., 1959. 


C. F. VorGELIN 


This book is in a series which is explicitly 
written for children (A Rainbow Book), and 
the author’s ‘first book for young people’ 
(p. 319). However, in the instance of Mar- 
garet Mead’s effort, a book is produced 
which is equally useful both for children and 
for adults who are no more sophisticated 
than children in how and why anthropolo- 
gists work in cultures other than their own. 
The first part (Man Learns About Man) 
and the last part (Man Asks About Man) 
generalize so successfully about anthropo- 
logical attitudes that the book as a whole 
may well be the one book to cite in answer 
to the frequent question, what is the one 
book to read about anthropology? (The 
famous text books in general anthropology, 
from Tylor on, are introductory—serve as 
prerequisites for further and narrower inter- 
ests—rather than as unique orientations 
wherewith to start and stop.) But, as would 
be expected, Margaret Mead has seen to it 
that no sensitive reader will stop reading in 
anthropology after reading the middle third 
of her book (How Some Peoples Live): The 
Eskimo (27 pp.), The Indians of the Plains 
(27 pp.), The Ashanti of West Africa (16 
pp.), The Balinese (16 pp.), The Minoans of 
Crete (18 pp.). 

The book ends with a page (319) aBouT 
THE AUTHOR, who ‘has made nine field trips, 
and learned seven different primitive lan- 
guages.’ Many anthropologists have made 
more than nine field trips; I know of no 
other who would admit to having learned 
seven different languages in the course of 
field work, though I do know of two who 
seem capable of such performance—Paul 
Garvin and Kenneth Hale. The talent is 


rare; those who have it should be able to 
teach the rest of us, for talent is at bottom 
successful technique. I quote that part of 
Margaret Mead’s teaching relevant to this 
point in extenso (from the generalizations on 
anthropological attitudes, part I, in the 
third chapter, The Anthropologist at Work, 
but just pp. 84-86 and 88-92): 


‘*’..a@ man watching another man can under. 
stand something about how he feels, and if he 
learns the other man’s language, he can ask him 
questions and listen to his answers. So the study of 
human beings in many parts of the world began 
with men and women, trained to be anthropolo- 
gists, who asked questions. At first they tried 
talking and listening through interpreters, people 
who know something about both languages, but 
this did not work out very well. The interpreters 
themselves often did not understand the questions 
and the answers, and so everyone was likely to get 
mixed up. To understand what really was meant, 
anthropologists had to learn the language the 
people spoke. 

“There are several problems about learning the 
language of a primitive tribe which you might not 
think of right away. The first thing an anthro- 
pologist has to expect is that the new language will 
be quite different from European languages. That 
is, the differences between the new language and 
English will be greater than those between, say, 
English and French and Latin, or English and 
German and Norwegian, or English and Russian, 
because all these languages belong to the same 
family. When we are just beginning to learn the 
language of an unknown primitive people, we do 
not know whether it will be the kind in which 
there are only short words which always stand by 
themselves. We do not know whether the language 
will make the kinds of differences ours does when we 
use she to talk about a girl, and he to talk abouta 
boy and it to talk about a stick. In languages like 
ours we can say: ‘‘He hit her with zt.’’ But in some 
other languages the same sentence would be 
“Third person hit third person with third person.” 
Still other languages make more differences than 
we do, as they have special pronouns not only for 
males and females and things but also for long 
things and round things and hollow things. We 
do not know whether the word for two of some- 
thing will be different from the word for one of that 
thing, and if it is, whether the difference will be 
like that between cat and cats, or mouse and mice, 
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no. 3 


or ox and oxen. We do not know whether the same 
words can be used for men and sheep and eggs or 
whether different words have to be used, such as 
crowd of men, a flock of sheep, a clutch of eggs, a 
pride of lions, a swarm of bees, or a covey of quail. 
We do not even know whether what we call ‘‘parts 
of speech”? such as nouns and pronouns, verbs, 
adjectives and adverbs—will be anything like 
those in English. 

[Citation of an Eskimo song that goes with a 
game cited by Franz Boas = ethnolinguistics]: 
‘From this song you can see that Eskimo is a 
language in which one bit after another is added 
on to change the meaning of a word. 

‘When an anthropologist arrives among the 
primitive people he wants to study, he must find 
someone who can interpret at least a little, so that 
he can ask what people call things and how they 
ask questions and ask other people to do things. 
Probably no one there will have any ideas about 
what a word or a sentence is; nor will anyone know 
that their language has a grammar. They know 
how to speak it and how to correct children when 
they make mistakes. So the anthropologist must 
behave partly like a child who is being taught to 
speak and partly like the first person who made a 
grammar; and he must patiently collect a great 
many sentences: ‘‘I see a man, I see a woman, I see 
a dog; I see it (a woman), I see it (a man), I see it 
(a dog); on and on, until he can both speak the 
language and write down its grammar. 

“.. The anthropologist understands a great 
many things about the savages which they do not 
understand themselves. But at the same time he 
can never feel exactly as they do, of course, and 
even though he speaks the language quite well, it 
is not his own language, his mother tongue in 
which he was sung to sleep when he was a baby... 

“.. Yet this is the way the first anthropolo- 
gists had to work... moving pictures and tape 
recorders had not been invented . . . His success 
in recording the language depended on whether his 
ear was sensitive enough to pick up sounds which 
did not exist in his language .. . 

“., And now we also have the moving picture 
camera and the tape recorder. Of course, it is still 
important for the anthropologist to use the fact 
that he is himself a human being like the people he 
is studying. The tape recorder cannot learn the 
language for him; the moving picture camera can- 
not make friends with anyone. But now for the 
first time he can bring back for other scientists to 
see and hear—and for himself to look at and listen 
to again and again—something that happened 
only once on a faraway island. As the microscope 
made it possible to see things too small for the 
naked eye and the telescope things too far away 
for the naked eye, film and tape make it possible 
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to catch and keep more sights and sounds than the 
brain can take in and sort at one time 

“So, today films ...and tape recordings... 
have become the anthropologists’ substitutes for 
experiments. In experiments, the scientist can do 
the same thing over and over—...In modern 
anthropology we still let history make the experi- 
ments for us, but we record them in such a way 
that they can be studied over and over. As our 
ideas grow and change, we can test the new ones 
out against the old films and the old tape record- 
ings long after the people who were filmed and 
recorded have deserted their old ways and have 
come into the modern world.”’ 


The above strikes me as the simplest and 
briefest statement of the well known Mead 
position on the value of learning the language 
of the culture one is investigating. I agree 
with this in general; and at some points, 
feel competent to add to its definition; but 
at other points fail to understand what is 
meant. 

(1) Interpreters—bilinguals who know the 
language of their own people better than the 
language of the anthropologist—translate 
the responses of monolingual informants to 
the questions put by the anthropologist. 
What subsequently appears in the scientific 
report, then, is a version of the monolingual 
informant’s response, mediated by the trans- 
lation of the interpreter. Such translation 
may be expected to include distortion be- 
cause it is what I would call a ‘reverse 
translation’—reversing the direction of cod- 
ing. Rather than coding with benefit of an 
inventory which one has interiorized, the 
interpreter translates into the anthropolo- 
gist’s language, without the benefit of the 
inventory and categories which he learned 
while becoming a Kwoma (or whatever his 
culture may be). This particular point can 
be less briefly but better stated in terms of 
‘selection’ and ‘the original word game’.! 

(2) The anthropologist who learns to 


1 Roger Brown, Words and Things (pp. xvi, 398. 
Glencoe, Illinois: The Free Press, 1958), reviewed 
by Harry Hoijer in Language 35.496-503 (1959); 
C. F. and F. M. Voegelin, Selection in Hopi Ethics, 
Linguistics, and Translation (in press, Anthro- 
pological Linguistics, 1960). 
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speak a very different language does pre- 
cisely the same thing when he addresses a 
monolingual informant in the latter’s lan- 
guage—he makes a ‘reverse translation’ and 
thereby introduces much the same kind of 
distortions that the interpreter makes when 
he translates from his native language into 
the anthropologist’s. In learning the lan- 
guage of the people one is investigating, 
‘reverse translation’ occurs when one talks 
to the people, but a less distorted kind of 
translation is possible from the utterances of 
the people talking to each other or to the 
anthropologist. 

(3) The latter kind of unidirectional trans- 
lation, which goes inwardly, utilizing in- 
teriorized inventories and categories, is pos- 
sible today withovt learning to speak the 
native’s language—with the use of a tape 
recorder and ancillary informants. That is, 
the original informant’s fast responses are 
taped, and subsequently played back little 
by little, with the original informant’s utter- 
ances repeatedly slowly by the ancillary 
informant; and so transcribed and trans- 
lated by the anthropologist, guided by more 
than one informant. 

(4) The use of the tape recorder for re- 
cording the language of the anthropologist 
who is learning the native’s language may 
give us some cues as to why or how some suc- 
ceed at this enormously difficult task, while 
others (most others) do not. 

INDIANA UNIVERSITY 


On CoNSONANTAL CORRESPONDENCES IN 
THREE ARAWAKAN LANGUAGES 


Dovueias TAYLOR 


Examination of such data on Arawakan 
languages as are accessible to me suggests 
that Island Carib, Lokono and Goajiro 
belong to the same subgroup, which probably 
should also include Paraujano and long ex- 
tinct Taino.! Phonetically, Island Carib of 


1 See IJAL 24.153ff. So far as the present note is 
concerned, it matters little whether such a sub- 
group is or is not a reality. 
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the 17th-century Dominican dialect (ab- 
breviated as IC) appears to have been much 
closer to Guianese Lokono of circa 1800 (L) 
than is modern Goajiro (G); but these last 
two languages have in common a consider- 
ably greater proportion (something in the} 
order of 57% as against 46% for IC : L) of 
basic lexicon. Innovation,? contraction and 
assimilation*® seem to have been far-reaching 
in the recent history of all three languages; 
and for this reason, forms cited below will be 
taken, in the main, from the earliest records 
to which I can refer.‘ ew be 
The several vowel systems are little under- lescripti 
stood (particularly so in the case of Lokono, ligek be 
for which to see or look is shown by different |, 
authors as: adika, adeka, adoka, aduka); liom ar 


ud onl 
be consi 
may be 
Hickers 
1, for t 














) phonolog 


> bibiti > biti four, biama > bian two, iihiikere >|. 
ekura root. Or compare L eke eat and eke container, 
clothes with their IC cognate equivalents, dika 


equivalent, kamfsa, and with its Sp. model. 
4For early IC, I have relied on Raymond 
Breton’s 17th-century Dictionnaire  caraibe- 
frangais and frangais-caraibe in Platzmann’s 
facsimile editions (Leipzig 1892 and 1900), and the 
same author’s Grammaire in the new edition by 
Adam and Leclere (vol. 3 of the BLA, Paris 1878).howels) ¢ 
For L, I have consulted the anonymou hkhir 
Arawakisch-Deutsches Wérterbuch and _ the} , hl 
Grammatik der Arawakischen Sprache in vol. 8 of ' . 
the BLA (which also contains short a 
of Baniva, Bare, Caouiri and Yavitero; Parislouble le 
1882), The Arawak Language of Guiana by C. HWentral 
de Goeje (Amsterdam 1928), which includes the indicate 
most elaborate comparative vocabulary 
Arawakan that we have, and two articles by N. P. honolog 
Hickerson in IJAL 19.181-90 and 20.295-301. Fort lackin 
G, I must depend on N. H. Holmer’s description dfnd in | 
modern Goajiro in IJAL 15.45-57, 110-21, 145-1 ¢ 9. Fy 
and 232-35, having been unable to obtain the 





spelt : 
earlier work of R. Celedon (in vol. 5 of the BLA). a 
Citations from other languages have come from A . va 
18 


de Goeje’s comparative vocabulary (a supplement 
to which is to be found in his long article, Nouvefarib (I 
examen des langues des Antilles, in JSAP 31.1-120}f} ajiro b 
and from various other sources too scattered to bi ; 
ed by 
enumerated here. : 
5 Person 
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und only consonantal correspondences can 
be considered here, although such limitation 
may be held by some to be invalidating. 
Hickerson finds five vowel phonemes, i e a 0 
1, for the modern dialect of Lokono; but 
ays that [i] and [e] on the one hand, [u] 
wd [o] on the other, vary freely in most 
jorms, though they are distinctive in some; 
und that there are instances of morpho- 
phonematic alternation between ‘stable’ /i/ 
and /u/.5 Under these circumstances, it 
night have been wiser to forbear; and had 
there been any prospect of new and better 
‘under. jescriptions of languages belonging to this 
40kONO, Itock being published in the near future, I 
ifferent |hould have done so. As it is, I shall refrain 
aduka); liom any attempt at reconstructing the 
— phonology of a parent tongue common to this 
ologica : 
¢ Island. |ubgroup (which, after all, may never have 
30. existed), and confine myself to seeking, and 
i morn-|rying to explain, the various correspond- 
, biabite laces among—as well as the developments 
wees within—the languages concerned. 
its. dika| One cannot expect to recover with exacti- 
thits 1¢ude the phonological inventories of the 
el. lder dialects; but for the purposes of this 
Raymondhote Island Carib, Lokono and Goajiro 
fest: y be taken to have the following pho- 
, and thefiemes and prosodic features: IC. m b pn nh 
dition bytsk hl riueo (= [é]) a V (nasalization of 
ris 1878).jowels) and V (‘free’ stress), L.m bpndt 
‘kkhlriwd(?)ueoa,G.mpwntcsS&y 
k?hlriitueoa V, together with con- 
1 vol. 8 of ee 
abulariesfnantal and vocalic length, shown here by 
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— “ indicated in citations by CAIC) differs 
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honologically from that of Dominica (IC) 
fn lacking the nh-phoneme (or sequence?), 
ription fad in having four additional phonemes: 
21, 145-5 g. Forms cited from other languages will 


btain thy spelt, as in the sources from which they 
the BLA). 
taken. 


from 
‘pplemen| A list of likely cognates follows; Island 


le, Nouveharib (Dominican dialect), Lokono and 
> 31.1-1Nffoajiro being shown in that order, and pre- 
ed by the English translation. 


5 Personal communication. 
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after -arisi : -adiki : -aciki, arm or wing 
aréna : adena : atina, ashes balisi : balisi : 
pari, bark or skin -tra: -iida: -tah, bat 
(quadruped) biliri : buhiri : posici, bird 
ulibiy : kudibiu : ucii, blood {ta-ho, -ita : 
iitii, -ite-na : -i8a, bone 4bu-hu : abuna-hii : 
himpu, capsicum Ati: hati: hasi, chop (or 
fell) Asuka : asuka : aSota-, claw or nail 
-ibara : -ubada : -upatau, drink Ata : ata : 
asa, ear arikai : adike: ace?e, eat dika: 
eke : ika-, eye Aku : akusi : a?u, flesh ékriko 
(CAIC égorogo) : isiruku : i?iruku (so Hol- 
mer; but de Goeje cites Siruku), fly (vb) 
amghara (CAIC ahdmara) amurda 
awatti-, foot ukiti : ukuti : -?uih, four 
bigburi : biabite > bibiti > biti : pienci, 
from (a sfx) -see : -se : -hee, hand tikabu : 
ukabu : ahappu, he likia : likia : nia, head 
fsik, isdko : isi-hi : -ikii, ekiwi, here idka : 
iaha : yaa, honey mgba : maba : mapa, hus- 
band iraiti: ireti: eeci, juice -ira : -era : 
-ira, leaf, liver -ubana : -pana, maize marisi : 
marisi : mariki > maikki, moon kati : kati : 
kasih, nose isiri : isiri-hi: -ici, one Abana, 
aba : aba- : wanee, path mabu : uaburuku : 
wopuh, say or speak arjaka : adiaka : 
aaSaha, aiSaha, sea balana, baldua : bara : 
palaa, she or it tukuia : tureha : sia (western 
dialect), hia (eastern dialect), sleep aruka 
(CAIC artmuga) : adunka : atyuka, stone 
i-siba-ni (meaning restricted to stone used as 
anchor) : siba : ihpah, tazl fli: ihi : -sih, that- 
masc. (one) lira : lira : cira, that-fem. (one) 
ttira : tura : tira, there idra : iara : caa, they 
nhakia (CAIC hagia) : nakia : naya, this- 
masc. (one) lika : lihi: cii, this-fem. (one) 
tuika : tuhu : tit, thou bukiia : bokia : pia, 
tobacco itili : iuli : yiri > yiih, tongue ininie 
(CAIC jéje-i) : -ie, ti-ie-hi : iyee, ¢ooth ari : 
ari : ari > aai, two biama : biama > bian : 
piama, we uakia : uakia : waya, with -Gima, : 
-uma : -umaa, woman jaru > hjaru (CAIC 
hij4ru) : hiaru : hiieer, ye huktia : hukia : 
hia. 

This list may be supplemented by others 
showing some likely cognates found in only 
two of these languages; so: 

IC : L. agouti pikuli (CAIC figuri) : 
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pukuleru > hokorero, ask (inquire or require 
of) Alakota : ahadakuta > adakota, boil 
(tumour) sibili : subuli, chew asdkura : asa- 
kuda, cut fbika: ibika, dig asika : atika, 
face isibu : isibu, fat (grease) -akéle, Akolie : 
ikihi, flame eléleni : eheludun, give isika : 
isika, asika, grass kalau : karau, hammock 
ékora (CAIC -égora) ukura-hu, hear 
akgba : akanaba, fill Apara: apara, leg 
-tirna : iidana, name fri: iri-hi, pot riara : 
duada, root ilakra : tihiikere > ekura, 
scrap(s) -ipe : -ipe, see arika : adika, slap 
apdtara : apatada, slice (vb) elésura : 
eheiisa, stink insi- > h{(n)si- : hisi-, worm 
(maggot) -ise : ise-hi. 

IC: G. flower fleue: -sii, *siwi, heart 
4nisi : ani > aij, lip iimaru : iimata (in IC, 
a compound: mouth-border), night aridbu : 
ali > aih (cf. IC ardbu forest, containing ar- 
tree, L uniabu water, containing uini river, 
rain, etc.), sun kasi: kaih, sweep (with 
broom) abuita : epitta, what kata : kasa. 

L: G. all -makua : pufua, az baru : pore, 
come anda : ent-, die ahuda : au(k)taa (the 
k is present in Holmer’s record but not in 
that of Celedon), father -ti : -Sih, fire hikihi : 
sikih (for Trinidadian L, Dudley recorded 
hiket in 1594-95), fish hime: hime, flow 
amalada : aparala, forest (L) kunuku : tree 
(G) unu?u, hair ubara : -wala, walas, horn 
ukoa : -uuwa, J dakia : taya (as against IC 
nuktia), know adita > aita : attha (ef. IC 
arita think), mother -iu : -eyu, neck unuru : 
anuulu, potato (sweet) haliti: haisi, sand 
mutuku : muaku, seed isi-hi : -si, three kabu- 
hin > kabun: apuni, walk or go akuna: 
a?una, water (substance) uniabu, (occurrence 
in nature) uini, uin : wil, wife iretu : eerij. 

Short as are these lists (54 items in three 
languages and 53 in two), they could not, I 
think, be usefully extended from the data 
at my disposal;—by which I mean that such 
additions as I could make would only mul- 
tiply instances of adequately attested cor- 
respondences without clarifying those that 
are obscure (as, for example, are the Goajiro 
reflexes of p, t and s in Island Carib and 
Lokono). 
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In the paragraphs that follow it has, | 
hope, been possible to explain some appar. 
ent irregularities (e.g., m : u : win the words 
for path) by different developments internal 
to the languages concerned; to accept other 
provisionally, on the strength of the evi- 
dence, while awaiting an explanation (eg. 
1 : h : s in the words for bat, tail, etc.); and 
to reject a few (e.g., apparent s : k in the 
IC and L words for after and in the L and 
G words for head) as incompatible with 
other and weightier evidence. 

IC m : Lm: G m (see under honey, 
maize, two, with and, for IC : G alone, lip, for 
L: G, fish, sand. Apparently irregular in this 
respect are the forms meaning bone, fly, path 
and, for L : G alone, all, flow; but an ex. 
planation can be given for at least two o 
these items. 

G himpu bone, when compared with such 
other Arawakan equivalents as Mehinaqw 
and Waura inapii, is seen to result from syn- 
cope of the vowel, a, and assimilation of the 
nasal to the labial articulation of the follow. 
ing stop. This nasal—or the morpheme con- 
taining it—has been lost in the IC and | 
equivalents, unless the latter should be a 
case of inversion (L abuna- < *nabu). 

Also the initial phonemes of the words fa 
path almost certainly result from assimilation 
(IC m&bu < *ndbu, L uabu- < *iabu- and 
G wopu- < *yopu-), preceded, in the cas 
of the last two languages, by palatalizatio 
and loss of *n (L *iabu- and G *yopu- from 
earlier *inabu or *inapu); cf. Bare dinabu, 
Maipure anepu and Piapoco ayapu, al 
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meaning path. Similarly, compare IC mii Goajiro 


with L nibi > mibi, both meaning bush-rops, 
liana, and IC mabi sweet-potato from earlie 
n&bi (‘‘a este age llaman los de Caribi nabi’ 
Chanca and Columbus, Guadeloupe), itsel 
borrowed from Karina napi, which has the 
same meaning. For palatalization and los 
of *n after immediately preceding i, sé 
under n : n : n below. 

As regards the apparent correspondence 
of IC and L m to G w in the words for fly 
and of L m to G p in those for all and flow, 
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Jcan only suggest that this may be the out- 
ome of dialect mixture within Goajiro, 
which is said to be common. Thus, Holmer 
cites from the speech of different informants: 
niicka = piicka, waima = maima. 

IC b: Lb: Gp (see under ashes, bat, bone, 
daw, four, hand, honey, leaf, path, sea, stone, 
thou, two and, for IC : L alone, boil, cut, face, 
hear, for IC : G, sweep, for L : G, az, three). 
Seemingly irregular in this respect are: 

G ucii (IC ulibiu, L kudibiu) bird and 
wala (L -ubara) hair, in which the labial 
stop appears to have been lost without com- 
pensation, and G wanee (IC abana, L aba-) 
me. So far as the last two instances are con- 
cerned, G p may possibly have been re- 
placed by w through non-distinctive voicing 
of the former phoneme, and its subsequent 
confusion with the w that replaces Sp. b in 
some loanwords like wooliiwa bolivar, kaawa 
cabo and tawaak tabaco. 

G pienci (IC bigburi, L biabiti) four (all 
containing, as first element, morphs mean- 
ing two, q.v.). Here we seem to have two 
assimilations due to vowel syncope: medial 
*» of expected *piem(V)p(V)si being re- 
placed by t, and its *m by n, as in pientéi or, 
since ¢ is a phoneme, pienci.® 

IC p (> CAIC f) : Lp (> f) : Gh(?). So 
faras IC : L is concerned, see under agouti, 
kill, serap(s) and slap. But in neither lan- 
guage is this phoneme found in a large num- 
ber of morphemes; and unless G alihuna 
stranger should be cognate with IC kalipuna 
and L kalipina, both meaning Carib, no 
likely correspondence for it can be shown for 
Goajiro. However, cf. G er(#) (L peru) dog, 
which was probably borrowed, like its L 
equivalent, from Spanish (rather than being 
cognate with IC Ayli, Piapoco auri and other 
similar words for dog, which were inherited). 

ICn: Ln: Gn (see under arm, leaf and, 


and los| for IC : L alone, leg, for L : G, come, forest, 


ng i, se 


neck, they, three, walk). Other instances are 


6 As will be seen below, G c could here come 


ondence from an earlier liquid (1 or r); but in that case one 


ds for fly, 
and flow, 


must conclude that the second labial stop was 
quite lost. 


NOTES AND REVIEWS 


247 


not hard to find; so, for IC : L, dunaba : 
aonaba answer, anéka : anika take or carry, 
antiata : anoa choose, andko : anakii high sea, 
or middle (of a place), -Ani : -ani own (prop- 
erty), néhene : nahali- paddle (n.). (I take it 
that final -ne and -li of the last example 
once belonged to separate and non-cognate 
morphemes; cf. Taino nahe, Bare neheu, 
both meaning paddle.) 

As in the cases of G himpu bone and pienci 
four (see above), vowel syncope brings 
about assimilation of a nasal, or its reduction 
to vowel nasalization (see examples under 
sleep and IC : L hear). 

In all three languages, n tends to be 
palatalized, reduced to vowel nasalization, 
or lost where i precedes (for examples, see 
under tongue, woman and IC : L stink; and 
compare with inene tongue and inaru woman 
of Carutana and other languages). In Lo- 
kono, all apical consonants may be pala- 
talized when followed by i (as in -ani, hati, 
etc.), without thereby losing their phono- 
logical identity; while in Goajiro, some at 
least undergo regular phonological change 
in this position, as in wij < *wini (L uni-) 
water, aih < ali night, ytih < yiri (L iuli) 
tobacco, -?uih < *-?uSi (L ukuti) foot. 

IC nh (>CAIC h) : Ln: Gn, as in: 
nhakia (CAIC hagia) : nakia : naya they. 

ICr: Ld: Gt, c (see under after, arm, 
bark, claw, ear, sleep and, for IC : L alone, 
chew, fly, leg, pot, see, slap, for IC : G, lip, for 
L : G, come, die, I, know). On the basis of 
comparison with other languages of the 
stock, it seems justified to group these two 
sets together, and to conclude that the G 
opposition of ¢ to t results from internal 
and fairly recent developments. 

The forms meaning bird and say should 
probably be included here, although they 
show, respectively,1:d:candr:d:§&. 
Interchange between | and r will be discussed 
below. Transposition or absorption of the i 
in the forms meaning say would account for 
G c rather than t; while loss of occlusion, 
resulting in §, may well be due to the conse- 
quent opening of the syllable. 
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This correspondence should be compared 
with that of | : h : s, for which see below. 

ICt: Lt: Gt, as in tira : tura : tira 
that-fem. (one), tika : tuhu : tit this-fem. 
(one) and (IC : G) abuita : epitta to sweep. 
No other instance of this correspondence 
was found in Goajiro. 

ICt: Lt: Gs, & (see under blood, capsi- 
cum, drink, moon, she and, for IC : L alone, 
ask, for IC : G, what, for L : G, father, 
(sweet) potato. The same remarks as were 
made above with regard to G t and ¢ apply 
to the grouping together of Gs and &. 

Anomalous and unexplained are: apparent 
IC s: Lt in asika : atika dig, for which I 
can find no likely cognate in Goajiro, and 
IC r : L t in bigburi : biabiti four, whose 
Goajiro equivalent, pienci, has already been 
discussed under b : b : p above. 

What looks like another instance of L 
(and in this case also IC) t : Ge may be 
seen in IC iraiti : L ireti : G eeci husband; 
though here again I think that G c might be 
resolved into t + §;sincel andr of this lan- 
guage are regularly replaced by its t before 
another consonant (so, hiieer woman + 
-kal > hiieetkal, and er(i) dog + -ka > 
etka); and eer- (by metathesis from ire-) + 
-$i would give eetsi, or eeci. (Alternatively, 
G eeci husband may be unrelated to its IC 
and L equivalents; and another possible 
ancestry for this form will be suggested 
under r : r : c below.) 

Expected s or § of Goajiro may be lost or 
reduced to h in some forms; cf. G muaku 
(L mutuku) sand and G atiha (L adita) 
know. But then it should be borne in mind 
that, dialectally, eastern G h and western 
G s or § replace one another in some words 
at least; so: hia or sia she, it, hV- or sV- she 
or her, it or tts. 

ICs: Ls: Gs, §. See under chop and, for 
L: G alone, seed; though in neither case is 
the Goajiro form indubitably cognate with 
the others. On the other hand, IC s : Ls is 
well attested (as in the forms meaning ashes, 
boil, chew, face, from, give, head, maize, nose, 
stink, stone). 
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ICs: Ls: Gh. See under from, stone | 
and, for IC : G alone, sun. The first, a| 
suffix, occurs as in IC iniysee from on high | 
(iniy high), L hururuse (out) from the ground | 
(hururu ground), G wattahee from afar 
(wattaa- far); and contrasts with the nomi- 
nal dependent suffix (IC -te : L -te : G -see), 
occurring as in IC nitlite : L daiulite : G 
tayii(h)see my tobacco. The other two exam- 
ples present rather a special problem (and 
that whether the consonant cluster of G | 
ihpah stone results from vowel syncope or 
from metathesis); for Holmer says that 
syllable-final -(i)h of Goajiro often repre- 
sents “original” -li, -ri or -ci, as in yiih < 
yiri tobacco. It would seem, however, that it 
might also represent earlier -si or -Si, as in 
the Goajiro forms meaning foot, stone, sun; 
(cf. also the L and G words for fire). 

Such apparent correspondences to L s as 
G c (see under nose), G k (see under head, 
maize), 1C k and G ? (see under flesh) must 
be rejected. As I have tried to show below, 
c of the Goajiro word for nose mest probably 
developed out of an earlier liquid (1 or r), and 
corresponds to r of the IC and L equivalents, 
The L and G forms meaning head obviously 
cannot be cognate if, as I believe to be the 
case, L isi-hi head, tip, top is identical 
(rather than homonymous) with L isi-hi 
seed, and if G -st seed is a cognate of this 
form. As I have previously suggested’ the 
IC forms meaning head may have contained 
a morph belonging to IC éko core, kernel 
etc., and their G equivalents a cognate! 
thereof, together with a cognate of L iti’ 
fruit; and a similar analysis seems at. least! 
possible in the case of the words meaning 
flesh and maize. And if this should be so, then 
all these forms, except L isi- head, etc., wert) 
compounds or at least complex; the IC and 
L words for maize being fully cognate but 
only partly so with their G equivalent; the 
IC and G words for flesh being fully cognate 
but only partly so with their L equivalent; 
while L isi- head is cognate with the firs! 
part of its IC equivalent, whose secont 


7See IJAL 24.78. 
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' member is cognate with the first part of G 


ekiwi head; the non-cognate morphs con- 
taining respectively s and k having probably 
been Synonymous or nearly so This inter- 
pretation finds some support in the fact G 
apparently has or had two words meaning 
flesh: i?truku and Siruku. 

ICk: Lk: Gk, ?. See under ear, eat, eye, 
flesh, foot, moon, sleep and, for IC : L alone, 
agouti, ask, chew, cut, dig, for IC : G, sun, 
what, for L:G, fire, forest, sand, walk. 
There is no indication that the glottal catch 
of G developed out of a positional variant of 
the dorsal stop; thus, compare L -luku : G 
-u?u in with L -ruku : G -ruku in the words 
meaning flesh, or the glottal catch in the G 
forms for eye and walk as against the dorsal 
stop in the G word for sand, which has the 
same environment. Nevertheless I do not 
believe, as does Holmer, that this fact neces- 
sarily indicates “‘the earlier existence of 
two k-sounds in primitive Arawak.” As I 
have tried to show elsewhere, k of early IC 
split into aspirated K and unaspirated g of 
the same island’s more recent dialect, devel- 
oping into h, voiced g and voiceless k (a rare 
phoneme) of CAIC; so that there is now no 
indication of why Breton’s k (as in his 
“narikié je regarde” and ‘‘natikié je péche’’) 
should have given CAIC h in one instance 
(narihia), but g in another (nadigia), or of 
why his “etiké noyau, & tout autre chose qui 
a une enveloppe”’ should have split into 
CAIC -ého extract, essence and -dgo nucleus, 
vulva.’ 

ICk: Lk: Gh (see under hand, say). 

ICk: Lk: GQ (zero). The glottal catch 
of Goajiro “is not found initially or before a 
consonant and is weakened in word-final 
position” (Holmer). This would explain the 
absence of an initial consonant in the G 
forms meaning bird, three and tree (cf. also 
IC ulibiy bird; though CAIC guribiua, the 
name of a species of bird, would seem to be 
its cognate). Elsewhere, loss may be due to 
palatalization, as in the G forms meaning 


§See IJAL 24.238, and especially IJAL 26, 
loc. cit. in footnote 2 above. 
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he, I, they, thou, we, ye (whose IC and L 
equivalents have a morpheme -kia or -kuia 
in common), or, it would seem, to labializa- 
tion, as in the G word for horn. 

IC k: Lh: G Q (zero). See under here, 
this-fem. (one), this-masc. (one). An analogous 
reduction has affected these same forms’ 
cognates in the other two languages; cf. re- 
cent IC idha, ttha, liha; CAIC id, toa 
[to:], lea [le:]; modern L ia, li, to. 

As in the case of the apparently ‘irregular’ 
correspondences to Lokono /s/ discussed 
above, I must reject the apparent corre- 
spondences of IC s to Lk and G k, as in the 
forms meaning after, and of G § ors to Lk, 
as in those meaning all and in L akiileka : 
asiliha extend towards, offer, give (cf. IC 
asilera arrive); and conclude that the mor- 
phemes concerned are not cognate, though 
the rest of the forms into which they enter 
may well be so. 

IC h: Lh:Gh (see under woman, ye 
and, for L: G alone, fish and sweet potato). 
Rather expectedly, this phoneme seems to 
be unstable in all three languages. Recent 
IC and CAIC both have h corresponding to 
L h as in: haba: haba (kind of) basket, 
haburi- : haburi- ashamed, hala : hala seat 
(n.), hinsi- or hjsi- : hisi- stink, hiaru (IC) 
and hja4ru (CAIC) : hiaru woman; though 
all of these have vowel initial in Breton’s 
transcription of early IC. This may not have 
been due to negligence on Breton’s part; 
for he discusses the aspirate, and shows it 
both initially and intervocalically in some 
other forms where h is today organic (e.g.: 
huktiia > CAIC hugtia ye, hama- > CAIC 
hama- fly, whence amghara > CAIC aha- 
mara to fly); while his ati > CAIC Ati (as 
against L hati and G ha&si) capsicum both 
lack it, as does CAIC 4 (sa)... ? what (or 
how) about... ?—supposing this form to be 
cognate with G ha < hama, which has the 
same sense. 

IC 1: Lh: Gs (see under bat, tail and, 
for IC : L alone, ask, fat, flame, root, slice, 
for IC : G, flower, for L: G, fire. Though 
some of the instances cited may be suspect 
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(thus | of the IC form for ask might corre- 
spond to either the h or the d of its L equiva- 
lent), this correspondence—whatever its an- 
cestry’—seems adequately attested by the 


9 Although there is some indication that this 
correspondence may be confined to positions 
before what was a high or a mid front vowel, its 
occurrence is not to be attributed (as is that of 
G ci corresponding to li or ri of IC and L; see 
below) to factors pertaining to the internal de- 
velopment of this or that language. Nor can it be 
ascribed to regular sound change operating in two 
of these languages on a phoneme that remained 
unaltered in the third. The PARENT phoneme no 
longer exists; but clues as to its nature, though 
slight, are not altogether lacking. 

In the years 1594-95 on the island of Trinidad, 
the Englishman Robert Dudley recorded as ending 
in ‘th’ and in “t’’ several Lokono words 
(‘‘addoth’’, ‘‘dasereth’’, “‘sibath’’, ‘‘tauraroth’’, 
“callit’’, ‘ureit’’) which lack these endings in all 
subsequent records (whose transcriptions indicate 
forms we write: ada, dasere, siba, kaiuru, kali, 
iuri or iuli). And some fifty years later on the 
island of Dominica, the Frenchman Raymond 
Breton recorded with final ‘‘t’’ several Island 
Carib words (‘‘arfbelet’’, ‘“‘bichet’’, ‘‘bourrélet’’, 
‘“hébechet”’ or ‘‘fbichet’’) whose modern descend- 
ants and (or) cognate equivalents in closely re- 
lated languages lack any reflex of this ending; so: 
CAIC alibele and L idibale, Taino bixa (as against 
Machiguenga puSote, a¢iotl of unrelated Nahuatl), 
Taino burén and L budali, CAIC hfbise and Taino 
hibiz. There is nothing very remarkable about the 
weakening and loss of consonants in final or in 
another implosive position; but in view of Breton’s 
description of an Island Carib speech sound as an 
apico-interdental lateral (v. infra in text for his 
own words), preceded, in the speech of a minority 
only, by an apical occlusion, Dudley’s “th” of 
Trinidadian Lokono is of interest. 

It suggests the earlier existence of a phoneme not 
unlike the tl of Nahuatl, which was eliminated 
from word-final position in at least two of the 
languages, but was merged in some other positions 
with IC 1 (through spirantization and loss of the 
first element), L h (through aspiration or spiranti- 
zation of the first element and loss of the second) 
and G s (through assibilation of the first element 
and loss of the second). 

It is tempting to seek another instance of this 
correspondence in the markers of the nominal 
absolute, L -hV (vocalic harmony) and G -8, as in 
L abunahii bone vs dabuna my bone, G walas hair 
vs tawala my hair,—especially as the Nahuatl 
suffix -\(i) has the same function! But that is im- 
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remainder. So, some other Arawakan word} 
for bat are: pisiri (Yucuna), pitseri (Cats 
politani), pitiri (Carutana & Siusi), Siepir 





probable; for IC has -ho ~ -hi ~ -hu (not *-lV 
with the same function as the L morphs; only G; 
(but not §) is found in other instances of this corr. 
spondence; the L and G absolutive suffixes do no 
both occur with cognate stems (thus: L uban 
hair, G himpu bone). 

On the other hand, comparison with cognate’ 
from outside this group (e.g., Baure huitere bai 
suggests that only where the ‘parent’ phonen 
was between high or mid front vowels—or, whe 
word-initial, was followed by such a vowel—dit 
it yield IC1: Lh: Gs; and that in other environ 
ments it gave IC r: Ld: Gt, c. Thus Carutam 
for example, has the same apical stop in pitir 
(IC bdliri : L buhiri : G posici) bat or itipi (Il 
fli: Lihi : G -sih) tail as in tawana (IC -trna :] 
-iidana) leg, and Manao has the same apical sto 
in ghiigaty (L hikihi : G sikih) fire as in ata (| 
ada) tree, -nyupdta (IC -tbara : L -ubada : ( 
-upatau) claw or -teki (IC arfkai : L adike : ( 
ace?e) ear. If this should be correct, it would seen 
that the IC, L and G words for bat have lost; 
front vowel, or replaced it by a back vowel, an/ 
that IC 1 in the word for ask corresponds not ti 
the h but to the d of its L cognate equivalent (« 
in the case of the words meaning bird). Moreover 
though Palicur -ribu éail and tiketi fire can hardl 
both be cognate with their IC, L and G equivalent 
(ep. also Palicur -riui fruit with the IC and ( 
forms meaning flower), the former language dos 
seem to have undergone a change analogous to thi! 
which presumably produced our two position 
sets (1: h: sandr: d: t,c); forits h regularly 
corresponds to Ld (IC r, Gr, ¢c), as in aha tre 
kuhipra b¢rd and himaka sleep. But here greii 
caution is needed, particularly in the case ¢ 
Palicur t, which apparently stems from more tha 
one pheneme of the parent language, and corr 
sponds to L t (as in atiti : hati capsicum), Ls (i 
in tipa : siba stone; cf. Carutana hipa and Mana 
ghiia stone), and L h (as in tiketi : hikihi fir 
or zero (as in pitana : biama two; ef. Piapoei 
putsaipa, Carutana ndzame, Ariti hinama ant 
Taino yamoka, which have the same meaning). 
Also IC 1 may correspond to zero of Lokono, asi 
liki- or like- or leko- : L -iki-n- only, alone; with 
which compare Taino heketi one. In this case i! 
least, the absence of an h in the Lokono fom 
may be explained by the fact that it does no 
apparently, occur without a pronominal prefit 
whereas with that of Island Carib, which is itsel 
often enclitic, only a suffix (pronominal or other 
is found. 
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(Ipurina); and for tail: ihibi (Bare), isipi 
(Ipurina), itsipi (Catapoliyani), itipi (Caru- 
tana & Siusi), -ribu (Palicur), enihu (Ariti, 
which lacks a labial stop); while from L ihi 
and IC ili tail are derived the verbs: L ihika, 
which the Herrnhuter author of the Ara- 
wakisch-Deutsches Wérterbuch modestly 
translates “‘concumbere con muliere’’, and IC 
alidkua, which has the same meaning. In IC 
flakra : L tihiikere root, IC 1 can correspond 
to nothing else but h of the Lokono (lost in 
modern ekura); and cf. Wapisiana luakare 
firewood, a probable cognate. No absolute 
form of G -sii flower is given by Holmer; but 
on the analogy of G -ikii, ekiwi head, and of 
IC uira : Liuida : G iita calabash, one might 
expect it to be or have been “*siwi, corre- 
sponding to IC ileue, Adzaneni eliwi and 
Bare hiwi, meaning flower, Palicur -riui and 
L iui fruit. L hikihi and G sikih (from earlier 
*sikisi?) fire have such other likely cognates 
as: tidze (Carutana, Siusi & Catapolitani), 
tiketi (Palicur), tikiahi (Saraveca), ilika 
(Ariti), all meaning fire, and Guana tikoti 
tree while IC uelékia fire-drill—with which 
compare Breton’s IC ‘“‘liki-likiti ou riki- 
rikiti, branlé, ébranlé”’—and (or) ileme blaze, 
kindling are at least possible kin. 
IC1:L1,r: Gr, 1 (see under ashes, sea, 
tobacco and, for IC : L alone, agouti, boil, 
flame, grass, for L:G, flow, hair, neck). 
The modern dialects of all three languages 
are said to have two liquids, | and r; and this 
opposition was already present, so far as I 
can tell, in early IC (cf. ili tazl but iri name, 
tila its contents but tira its juice). The 
Herrnhuter Grammatik der Arawakischen 
Sprache even claims (but does not write) a 
third liquid “‘zwischen / und r, den man sehr 
schwer nachsprechen kann”; and Breton 
may have meant to indicate something 
similar for IC by his double “‘ll”” (employed 
inconsistently, and not distinguished here 
from simple 1, since apparently non-distinc- 
tive), or by his “dl” in “dleuti” (and only 
one other morpheme), on which he com- 
ments: ‘la plus part disent lewti, mais cét | 
se forme par le bout de la langue engagée 
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entre les dents.’’ Nevertheless, | and r are 
given as interchangeable in some forms (cf. 
that cited from IC at the end of the preced- 
ing paragraph, and L “dkkulu, ikkuru, das 
Knie”, “wurrdru, wulldélu Wolken,” Nebel’); 
and minimal pairs showing this opposition 
are extremely few (for Lokono the best I can 
find are: “kaélli Cassabi” vs “k4ri-hi Krank- 
heit, Schmerz”, “ulliku in” vs “trruku 
Nymphae”’; which are valid only if we sup- 
pose stress and gemination or length to have 
been non-distinctive, as appears to be true 
of the modern dialect). 

On the other hand we find, as between 
earlier and later dialects both of Island Carib 
and of Lokono, besides 1 > | and r > r, 
1 > r (perhaps especially, but neither only 
nor always, Breton’s “Il” > r), but only 
very exceptionally r > 1 (IC aribelet > 
CAIC alibele roasted meat is my only exam- 
ple). So, for instance, IC itli : L iuli tobacco 
and IC balana or baléua : L bara sea have 
become, respectively, recent IC and CAIC 
iuri : modern L iuri, recent IC and CAIC 
bardna or bar4ua : modern L bara. 

IC 1:L1:Gn. This correspondence— 
if it be one—was found only in the words 
meaning he, and in the prefixes meaning he, 
his. I can offer no explanation. 

IC r:Lr:Gr (see under flesh, juice, 
maize, that-fem., that-masc., tooth, wife, 
woman and, for IC: L alone, hammock, 
kill, name, root, for L : G, az). 

The development of G mariki > maikki 
maize may perhaps be explained as the 
product of metathesis (mari- > *mair-), 
morphophonematic alternation (*mair- + 
-ki) *maitki) and assimilation (*maitki > 
maikki), in the same way as we have sought 
to explain the Goajiro word for husband. 

G eerti wife (cf. L iretu) may likewise re- 
sult from ire- > eer- + *-ni > -i; and if so, 
it is only partly cognate with its Lokono 
equivalent. 

An | or anr may be lost in Goajiro, as in L 
haliti: G haisi sweet-potato, IC idra: L 


10 Cf. IC alfru white cloud, ual{u black cloud. 
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iara : G caa there, G ari > aai tooth, this 
latter with compensatory lengthening. 

IC li, ri: L li, ri: G ci (see under bat, 
nose, that-masc., this-masc.). The Goajiro 
reflex seen here is an OCCASIONAL correspond- 
ence due to various factors such as weaken- 
ing, dissimilation and analogy. That it is 
not a regular sound shift appears from the 
occurrence of such a form as G pari (IC 
balisi, L balisi) ashes. 

In such Goajiro forms as show it, an 
earlier liquid must have become palatalized 
and fallen together with y (cf., in closely 
related Paraujano, yi nose and eiy(e) ~ 
eitsch man; the latter apparently cognate 
with IC eiéri man). This y was then 
STRENGTHENED to [d%]—a sound which one 
of Holmer’s informants, he tells us, em- 
ployed “for initial y in yiih tobacco and in 
some similar words.”’ However, in a lan- 
guage whose phonology includes a hushing 
palatal affricate, c, but no voiced—voiceless 
opposition, this sound could hardly escape 
identification with the affricate; and this 
has clearly taken place in the case of G caa 
(IC idra, L iara) there, which otherwise 
would have been indistinguishable from G 
yaa (IC idka, L iaha) here. There is there- 
fore no reason to suppose that the ¢ of G 
cira (IC lira, L lira) that-masc. (one), G cii 
(IC lika, L lihi) this-masc. (one) and G posici 
(IC biliri, L buhiri) bat had a different origin. 

Some doubt might remain as to whether 
the c of G -ici nose corresponds to the s or to 
the r of IC isiri, L isiri-; but no other indica- 
tion of the former correspondence has been 
found; Paraujano yi nose points to *iyi as the 
immediate parent of the Goajiro form; and, 
while Wapisiana iribe and Paumari iridi, 
both meaning nose, lack any reflex of s in 
the IC and L equivalents, no likely cognate 
that lacks a reflex of these same words’ r 
has been found. 

The Paraujano forms meaning man as 
cited above are sufficiently similar to G eeci 
husband to throw doubt on the latter’s 
ancestry. In this form cognate with IC 
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irditi and L ireti husband, as we have sug- 
gested, or with IC eiéri (Palicur hifeiri, 
Piapoco asieri) man? If with the former, one 
might expect the hypothesized metathesis 
(ire- > eer-) to remain stable in this mor- 
pheme, whose cognates are also found in L 
ire-nu children and IC (i)ra-ho child, little 
(one). But we have G ahapp-ira finger (hand- 
child) corresponding to IC -tikabu-r4-hi with 
the same meaning, where this transposition 
of the r has not occurred. It may therefore be 
that G eeci husband is cognate with the IC 
and other forms meaning man; and that G 
eerij wife contains a contraction (or one 
morph?) of G hiieer woman (cf. Palicur ero 
woman, Manao ero wife) together with 
some other morpheme (cf. IC and L -ani 
belonging, Maraua runo woman). 
Dominica, British West INpIEs 


COMPOUNDS AND COMPARISON AGAIN 
Dovueias TAYLOR 


As I tried to show in a recent note under 
the same heading (IJAL 24.77ff.), compari- 
son of not too distantly related Arawakan 
languages often reveals the presence in one 
of PETRIFIED former morphemes (both roots 
and affixes) that are still ‘alive’-—can be 
isolated, that is to say—in another.! It 
may then appear that some very similar 
equivalents in two languages, the one a 
compound and the other synchronically 
complex or even a simple form, are no more 


1So, Lokono -itima beard or moustache appears 
to be unanalysable in that language (despite its 
uima mouth of a river); whereas its Island-Carib 
equivalent, -{tiuma, is readily resolved into -iuma 
mouth and iti- (ef. ftiburi hair, and bi4buri four 
from bfama two) hair, strand, piece. Similarly, 
Island-Carib irditi husband (despite irého child of 
the same language) is not susceptible of analysis; 
whereas its Lokono equivalent, ireti, is readily 
resolved into ire- (as in irenu children with the 
nominal pluralizer -nu, ireiu or iretu wife) child 
and -ti father (as in dati my father; cf. daiu my 
mother). 
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than partly cognate.? In what follows, I 
call the reader’s attention to a few further 
possible instances of both sorts, and try to 
amend what now appear to have been a 
couple of faulty interpretations on my part. 

There can be no doubt that Lokono 
‘“diussi die Warze an der Brust” and “jahissi 
Kattun-Saamen” are compounds of the 
same language’s “idiju die Brust” and “jaéhu 
lattun”’ with “issihi der Kopf, das Ausserste, 
ler Gipfel: dassi, bissi, lissi, tissi, wassi, 
hissi, nassi [my, thy, his, her, our, your, their 
head, etc.]; heisst auch: Saamen’’; and that 
irespective of whether the latter entry 
tites two homophonous morphemes, or 
oly one as the author seems to imply.’ 
And it might be justified to recognize its 
“wi die Frucht an allerlei Baéumen’’, to- 
gether with -si seed, in “iwissi cuniculi, die 
Hoden; auch wohl: penis; diwissi, liwissi 
oder désse, lésse; fwissi hikkudu die Hoden; 
iwitikittin fruchtbar machen;...’’. But then 


* For example, in view of such correspondences 
a8 are apparent in Goajiro iita, Island-Carib uira 
and Lokono iuida, all meaning calabash, syn- 
chronically unanalysable Goajiro iimat(a) lip, 
whose parenthetic vowel is often omitted, might 
sem to be fully cognate with its Island-Carib 
equivalent, iidmaru, a compound of iuma- mouth 
with -aru border or edge. But this compound’s 
second member is comparable with Lokono aruma 
side, edge, coat; and while Island-Carib /r/ corre- 
sponds to either /d/ or /r/ of Lokono, Goajiro /r/ 
corresponds to /r/ of Lokono, Goajiro /t/ to /d/ 
of Lokono. It is therefore clear that the Goajiro 
word for lip never contained a coguate of Island- 
Carib dru border, but, instead, a morph belonging 
to its own -tah (Island-Carib -tira, Lokono -iida) 
skin. Analogously Lokono, from its -leruku mouth, 
has formed -lerukuda lip (cf. also French Creole 
lapo buS lip or lips, but literally skin [of 
the] mouth). 

§ Citations of Lokono are from the Arawakisch- 
Deutsches Worterbuch or from the Grammatik 
der Arawakischen Sprache, both anonymous and 
both contained in volume VIII of the Bibliothéque 
linguistique Américaine; Paris, 1882. If Schultz 
and Schumann were the respective authors of 
these works, as is generally believed, they were 
most probably compiled during the last quarter of 
the eighteenth century. 
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we also find: “‘issinihi penis: dassin, bissin, 
lissin; tissina id quod est in nymphis;... 
fnissi cuniculi: désse, bésse, lésse; elonti 
fnissi” little boy’s testicles; from which it 
appears probable that inisi contains -si seed 
in common with its quasi-synonym iwisi; 
and at least possible that inisi and isini- 
(-hi is a suffix marking the absolute noun) 
contain or contained the same two mor- 
phemes differently ordered. But if so, ini- 
and -ni must belong to a morpheme other 
than iwi fruit; though possibly cognate with 
Breton’s Island-Carib “im” and ‘“-im” or 
““ijm”’, which he translated in different 
contexts by fils, fruit, graine ou semence, 
oeuf; and which, as hj, -{ of the modern 
Central American dialect of that language, 
still has the same range of referents. 

In the earlier note referred to above, I 
concluded that Lokono -wasini or wasina 
heart and its equivalents in Island Carib 
(nisi), Goajiro (-aj), Guinao (inisini), Piro 
(-wahi), Saraveca (-vihine), Maipure 
(-nikini) and Manao (-niki or -neki) were all 
cognate; but I am bound to reconsider this 
position after finding the Herrnhuter 
author’s Lokono “Ansi, Anissi, 4nsiwa’”— 
undefined, but apparently referring to the 
non-physical heart, as in the example of its 
use: ‘‘jura ddnsi ich verlange’’,—more lit- 
erally perhaps, my heart (dansi) urges or pulls 
(jGira). As in the case of inisi testicles and 
isini- penis, so Lokono an(i)si(wa) heart = 
own mind or soul or desire may well contain 
or have contained the same morphemes, in 
reverse order, as -wasini or -wasina heart = 
organ of the body of the same language. 
But they are nevertheless different, ap- 
parently non-interchangeable constructions; 
and the whole problem is complicated by the 
existence of other possibly related Arawakan 
forms like Piapoco inasi and Ipurina isini, 
both meaning flesh. 

In another note in this journal (25.195-96), 
I tentatively concluded that Island Carib, 
denominal attributive “kinchin cher, aimé” 
and privative “minchin”’, its antonym, 
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whose stem became -ise or -isen in the recent 
Dominican dialect and -isie in that of 
Central America, were cognate with Lokono 
“kanissin lieben” and “mansin nicht lieben, 
hassen”, but had been reshaped under the 
influence of borrowed Karina ise to want or 
to love. i still think this influence most 
likely so far as the later developments are 
concerned; but in view of the occurrence in 
recent Dominican of such a form as: kanisiti 
nuni J like it—more lit.: it-is-agreeable to-me 
(containing Anisi heart, inclination or mind), 
besides khiseti ntini (containing -ise), with 
much the same meaning, I now find it more 
plausible to compare 17th-century Island- 
Carib -jsj or -insin (the status of nasalized 
vowels in the earlier recorded stages of this 
language is uncertain) with Guinao inisini 
heart and—semantic difference notwith- 
standing—Lokono inisi testicles than with 
Lokono 4n(i)si and the forms meaning love 
and hate as cited above. It is of course not 
inconceivable that Island-Carib -insin and 
Anisi, Lokono inisi and 4n(i)si (whence 
-anisin), should have resulted from the 
splitting of what once were the same mor- 
phemes in the same construction; but it 
seems clear that these forms were quite 
distinct in both languages at the time of our 
earliest records. 

Moreover, Lokono -dn(i)si and Island- 
Carib -A4nisi—with which compare Goajiro 
-ai (<*-ani) life, heart, soul; a form that 
enters into expressions of love or desire— 
appear to contain -Ani own (property or 
concern), a stem found in both the former 
languages,‘ together with -si seed. And if 
this should be so, it is easy to understand 





4 So Lokono: “‘ani es gehért, ist eigen; tuhu dani 
das gehért mir; kia bani je was dir; tain (ta ani) 
nach seiner Art;...’’; and Island-Carib: “ani 
chose’, “‘kat ani lika? d@ qut est-ce que cette chose 
appartient?’’, ‘‘nani-kia c’est d moy’’, ‘“‘nani mhem- 
kia j’en ay affaire’’, ‘‘ao kani lika je prend celd’’. 
In both languages it may be verbalized; so 
Lokono: “anin tun, machen’”’; and Island-Carib: 
‘‘anf-bae lika fais cela’’. That it denotes, in Island 
Carib as in Lokono, property rather than thing may 
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how such a compound might be employed 
in reference to the heart—either as a mental 
and emotional essence, as a bodily organ, 
or as both. 

It is more difficult to reconcile the se. 
mantic aspects of the other pair of forms 
that I have assumed, perhaps wrongly, to 
be cognate. Breton’s “kinchin cher, aimé” 
may, in all likelihood, safely be analysed as 
containing the attributive prefix kA-, 4 
subordinative suffix -n, and a nominal stem 
-insi- which seems to mean dearness. But is 
the latter an etymologically simple form? 
If it is really cognate with Lokono inis 
testicles and (or) Guinao iniSini heart, it 
most probably contains -si seed, etc., to- 
gether with a morph that I can best compar 


to Breton’s Island-Carib “im’’, hi and -j df} 


the modern Central American dialect, and 
which might here be glossed kin, kind, kin. 
dred, or still more plainly, progeny, profit; 
(but ef. also Lokono uina, -ina and Island. 
Carib iuna, -ina, both meaning bas, 
foundation, stem). 

Lokono akusi eye(s) and akuke eyelid(s 
appear to be compounds containing a cog 
nate of Island-Carib 4ku and Goajiro a?u, 
both meaning eye(s), together with, respec. 
tively, Lokono -si seed and eke container, 
covering. But can this cognate be identified 
with the only similar free form listed in the 
Herrnhuter Worterbuch: ‘aku in, an, auf, 
tiber’’? 

It seems plausible—though _ perhap: 
unlikely—that Lokono and Island-Canii| 
mArisi maize should be a compound of thos; 
languages’ mari toothless or, at least in the 
former language, blunt (without edge a 


be gathered from Breton’s own examples of its 
use, or from the following, which were recorded 
by me among the Black Carib of Central America 
bdni lira that belongs to thee, that (is) thy property, 
that (is) thy affair; or verbalized as in: hanf-ba 
(idragie) take it (from there). With the basal suffi 
-(i)na of this dialect (v. IJAL 22.14), it mean 
cause, as in: taénina iquari the cause of trouble, th 
root of evil (ifuari evil, wrong); and the verb dnidi 
means to own, to possess ‘en propre’. 
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point), from 4ri tooth, teeth, together with 
Lokono -si seed. Another possible etymology 
for the first part of this name is Lokono 
mari (Island Carib miri) nameless, from fri 
name of both languages. 

But although -si seed, etc., remains as a 
Lokono stem, there are indications that it 
has sometimes degenerated, even in that 
language, to a suffix, if not further. So, 
compare Lokono and Island-Carib_balisi 
ashes with abalin- ‘“‘voriiber gehen, weggehen”’ 
of the former language, bali-ba ‘“‘torche, 
nettoye’”’ of the latter, and with Piapoco bali, 
Goajiro pari, both meaning ashes; or Lokono 
and Island-Carib -ubilisi dust, spray, chaff 
(according to context) with abuledin- 
“wegwerfen” and abuledunua “eine Quelle’ 
of the former language, buli- spring up out of 
and A4bulikani ‘‘source’’ of the latter. 

In Island-Carib -is{pisi-fra gall, containing 
sipi- bitter and -fra juice, the morph -si can 
have none but a purely grammatical func- 
tion (nominalizer?), with which Lokono 
itself dispenses in its isipe gall, from sipe- 
bitter. Island-Carib (n)4risi and Lokono 
(d)adiki, both meaning after (me) or (my) 
trace or footprints, most probably contain 
the same, cognate morphemes as are seen in 
Island-Carib 4ri navel and Lokono adi upon, 
over (in addition, more, apart); but there are 
no grounds for supposing that -si and -ki 
of these words are cognate, though they 
may well have the same function or signifi- 
cance. 

It may seem unreasonable to refuse to 
recognize a correspondence between sibilant 
and stop of Island Carib and Lokono (ef. 
also IC asika and L atika, both meaning 
dig) in view of the fact that Island-Carib 
anisi, -drisi, balisi, ébesi (payment, recom- 
pense) and -isipisi-ira of the Dominican 
dialect have become: anigi, -drigi, balfgi, 
ebégi and -igifigiré in that of Central 
America. However, in the case of the Island- 
Carib dialects we have evidence of the 
splitting, in St. Vincent, of s into s and ¢ 
(anici, ébeci), and of the all but universal 
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replacement by analogy, in Central America, 
of /ci/ by /gi/; whereas such apparent 
correspondences between Island Carib and 
Lokono are few and sporadic (see my note 
on the history of Island-Carib consonantism 
in IJAL 26. ff.). 

Island-Carib arikai and Lokono adike, 
both meaning ear(s)—that is, the external 
appendage so called; there being other words 
for the organ of hearing as such—most prob- 
ably contain IC ari- and L adi-, as in the 
forms meaning after (see above), together 
with morphs belonging respectively to 
Island-Carib 4kai and Lokono eke container 
or covering. Cf. Lokono akuke eyelid(s), 
mentioned above; de Laet’s “saeckee”’ mother, 
recorded in 1598, modern -sake uterus, egg- 
shell, containing -sa child, egg; and Island- 
Carib uldkai belly, containing, according to 
Breton, tile food. 

Island-Carib (h)urira and Lokono iditira 
milk are transparent compounds containing, 
respectively, uri and idiu breast together 
with -fra and -éra juice; these second mem- 
bers being found also in Island Carib Aku-ira 
and Lokono ikira, both meaning tear(s). 
In its two stresses and lack of vowel re- 
duction, the latter Island-Carib form is more 
like a syntactic construction (“eye juice’) 
than a compound; but in view of Lokono 
akuke eyelids (see above), we can hardly 
accept the first member of Lokono ikira 
tears as an allomorph of aku-. This morph 
apparently occurs again in Lokono -ikiti, 
ikitihi eyelash(es) (cf. Island-Carib A4ku-fu 
eyelashes, containing fu “poils, épines’’), 
for whose second member see footnote 1. It 
seems possible that this iki- may be related 
to “ikin regnen” and (or) to “ikin einzig’’, in 
reference to the distinct nature of the sev- 
eral tear-drops and eyelashes (cf. also -ikisi 
count, reckoning, etc.). 

Evidently a case of not seeing wood for 
trees is my own blindness, hitherto, to the 
full significance of Taino ‘arcabuco’ forest 
as a presumable cognate of its Island-Carib 
equivalent, arabu (Dominican) > drabu 
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(Central American). This significance is 
emphasized by a comparison of Island- 
Carib aridbu (Dominican) > driabu (Central 
American) with Lokono orikahu, both 
meaning night. In ascribing to these Island- 
Carib words and to Lokono unidbu water 
the status of etymological compounds (IJAL 
25.192), I was, it seems, quite right; but the 
historical complexity of the forms now ap- 
pears greater than I then perceived. 
Island-Carib ardbu forest < _ ar(a)- 
(Lokono ada) tree or trees + -kabu comprising 
+ -ku at, with loss of the dorsal stop in the 
second member and omission of the third. 
This -kabu is no doubt cognate with the 
homophonous suffix of early Island Carib 
and with ‘propensive’ -habu of the modern 
Central American dialect (numbered 323 
in IJAL 22.4 & 14); and most probably 
enters, as stem, into Island-Carib tikabu 
(Dominican) > thabu (Central American) 
hand(s), Lokono “Uekkabbu die Hand, 
insonderheit das Vordertheil, die Finger’’. 
When we compare the latter part of this 
citation with the same author’s “Abtttun 
anfassen, greifen, halten’’, and with Island- 
Carib dbuta to pick up, to gather or take in, 
it becomes clear that these verbs and the 
nouns meaning hand alike contain a form 
abu with (bearing, holding, in possession of), 
common to both languages, and itself having 
the grammatical behaviour of a noun (v. 
IJAL 22.36). The form kabu taking in, 
comprising, which must have entered also 
into Island-Carib aridbu night and Lokono 
unidbu water, is therefore itself complex, 
consisting of the attributive prefix kA- 
together with the nominal stem -abu; (ef. 
Goajiro aih < ali nzght, and Lokono tin, 
tini or uni river, rain, water). The final 
syllable of Taino ‘arcabuco’ is not found 
with the Island-Carib and Lokono forms 
cited here; but unless it should be an in- 
dependent suffix, it may belong to the same 
morpheme as Taino ‘-aco’ eye and be cognate 
with Lokono aku “in, an, auf, tiber (‘ain 
tiber’ sonderlich vom Feuer und Wasser), .. ; 
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akulukku in (wird vom Feuer und von den 
Augen gebraucht), ..”’; ef. also Island-Carib 
aku eye(s), Goajiro -a?u eye(s) and -a?u on, 
Lokono akusi eye(s); or Lokono abu- have, 
but kabukii- ‘“‘kriegen, bekommen’’. 

Taino ‘conuco’ cultivated plot or provision 
ground is certainly cognate with Lokono 
“Kinnuku der Busch” and Goajiro unu?y 
tree(s); but it seems very doubtful whether 
these words can be related to Lokono 
“akunnuku zu gehen’’. 

Dominica, British West INpins 


Rate or Morpuemic Decay 
IN Meccan ARABIC 


ARNOLD C. SATTERTHWAIT 


The validity of the theory of lexico- 
statistical dating depends on the hypothesis 
that the percentage “of the basic-root- 
morphemes of a language which will survive 
as cognates after 1000 years zs constant for 
all languages at all times.’ If two different 
datable time-periods in the development of 
the same language can be determined, the 


percentage of the basic-root-morphemes of! 


the language which have survived per 
thousand years; i.e., the rate-constant of 
the formulas, can be found. Insofar as the 
rate-constants for the different languages 
are approximately the same, the theory is 
confirmed. Investigation of the validity of 
the theory involves the examination of as 
many languages as possible to determine 
the rate-constant of each language. The 
validity of the theory depends upon the 
degree of agreement between these in- 
dividual rate-constants. 

Meccan Arabic lends itself ideally to this 
type of investigation to which originally 13 
languages were subjected.? This dialect 


furnishes a datable body of material, the! 


Qoran, which was revealed to mankind 


1 Robert B. Lees, The Basis of Glottochrondl- 
ogy, Lg. 29.119. The italicized words were inserted 
for this paper. 

2R. B. Lees, op. cit., 118-119. 
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by an identifiable Meccan, the prophet 
Mohammed, between 610 A.D. and his 
death in 632. The text was standardized by 
a second Meccan, ‘Uthman ibn ‘Affan, 
between 645 and 650. It is generally agreed 
that the consonantal text of the Qoran has 
undergone little change since that date. 

In the following note the rate-constant 
for Meccan Arabic is obtained by com- 
paring the Arabic of the Qoran with the 
Arabic spoken by contemporary Meccans. 
For purposes of this investigation I have 
assumed that the Arabic of the Qoran repre- 
sents the particular Meccan dialect of 
Arabic in 632 A.D. from which the dialect 
spoken by Meccans today is derived. The 
rate-constant obtained by this comparison 
will be compared with the mean rate- 
constant derived by R. B. Lees, .8048 + 
0176/mill.* and the more recent mean 
rate-constant established by Morris Swadesh 
at about 6% above Lees’ through further 
refinement of the method.‘ 

In order to estimate the validity of a word- 
list it is necessary to know how it was 
formed. I established the Qoranic word-list 
as follows. (1) I collected all Arabic words 
which I recognize as being equivalent to 
Swadesh’s revised list of 100 semantic 
items. (2) Checked this list against a 
Qoranic concordance.*® (3) Generally selected 
the most frequently occurring item as the 
Qoranic equivalent of Swadesh’s semantic 
term. Only occasionally did I investigate 
the exact meaning of a word by reference 
to the Qoran. In several instances this in- 
vestigation led to the rejection of an item 
which occurred more frequently. 

This method of obtaining a word-list 
can be expected to skew the results a bit. 
Words with which I am not familiar have, of 


5R. B. Lees, op. cit., 119. 

‘Morris Swadesh, Towards Greater Accuracy 
in Lexicostatistic Dating, IJAL 21.123. 

Sop. cit., 1383-137. 

6M. F. Barakat, al-Murshid ila Ayyat al- 
Qur‘an al-Karim wa Kalimatiha, al-Matba‘ah 
al-Hashimiyah, Damascus, 1939. 
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course, not been considered. Such words 
might possibly occur more frequently in the 
Qoran than those known to me. The result 
of omitting these words from consideration 
would be a tendency toward a higher re- 
tention rate than should actually be the case. 

The following list includes all the items 
investigated with the results tabulated. The 
first word in each instance is the one se- 
lected as the Qoranic equivalent. The num- 
ber in parentheses following each item 
indicates the number of times it appears in 
the Qoran. For high-frequency items I have 
indicated only the number of columns in 
the concordance, for example (4C), needed 
to list all occurrences. (N) indicates the item 
occurs but is not listed. Items followed by 
no number were checked but do not occur. 

1. all kull (47) ?ajma‘ (22) kaaffatan (5) 
jamiy‘ (3) qaaTibah, saa?ir, 2. ashes HuTaam 
(3) rumaad (1) hamiyd (1) Hadiyd sharp 
(2) iron (5) Hadiydiyyah, sakan, dardaar, 
xaraabah, tlw, jmr, fam, 3. bark none, 
jid* trunk (8) giraa? veil (2) qisr, liyf, laHaa? 
jilaaqah, 4. belly barn (4) womb (8) valley 
(1) plus 7 additional occurrences not in- 
vestigated, jawf (1), ma?nah, kars, karis, 
ma‘idah, 5. big kabiyr (42) ‘adiym mighty 
(more frequent; rejected on meaning), 
samiyn (3) jasiym, jaliyl, ‘awd, pxm, 6. 
bird vayr (18) Taa?ir (1) ‘usfuwr, 7. bite 
‘app (2) grp, garraD, srk, prs, qazqaz, jazjaz, 
nh’, 8. black ?aswad (2), 9. blood damm (7), 
10. bone ‘adm (2), 11. breast sadr (many 
times), nhd, bzz, zaan, @dy, 12. burn ?i 
Htaraq (1) Sbb, 18. claw none, xlb, pfr 
mnésr, DIf, muxbaat, 14. cloud siHaab (9), 
gym, nw”, 15. cold baarid (2), 16. come ?ataa 
(15C), jaa? (1044C), Hapar (1C), 17. die 
maat (2144C), tuwuffiy (144C) 18. dog kalb 
(6), 19. drink Sarib (13), skr (5), 20. dry 
yaabis (2), jaaff, n&f, 21. ear ?udun (16), 
sam‘ (12 at most), masma‘ (2), 22. earth 
?arp (many) turaab (2/5C), 23. eat ?akal 
(446C), Ta‘im (2/3C), 24. egg none, bayp, 
djj, 25. eye ?absaar eyes (most but not by 
much) ‘ayn eye, 
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26. fat Sunuwm (1) dihn, samn, 27. 
feather none, riyS splendid garments (1), 
ziynah adornment, janaaH crime, wing, jild 
skin, 28. fire naar (most), Hariyq (3), Daw”, 
?abbuwliy, 29. fish none, samak, Hawt, 30. 
fly Taar (2), saafar, nafar, 31. foot rijl (14), 
qadam (6), ?jj (2), ?rj, kff, 32. full ?i mtala? 
(1), muli? (2), mu‘abbaa, matruws, 33. give 
°a'taa (10), ?dy (6), ?ahdaa, 34. good 
tTayyib (35), Hasan (17), zayn, kuwayyis, 
maliyH, maniyH, SHw, 35. green ?axpar 
(occurs), 36. hair ?aS‘aar (1), riyS splendid 
garments (1), 37. hand yad (4C), kff, 38. 
head ra?s (1C), 39. hear sami‘ (4C), 
tasamma‘, 40. heart qalb (414C), fu?aad 
(3/4C), 41. horn none, garn (0 except in 
name of Alexander the Great), ruwq, nafiyr, 
Sabbuwr, buwq, 42. J ?ana (N), 43. kill 
qatal (most frequent), SabaH sacrifice (oc- 
curs), 44. knee none, rukbah, kuw‘, zaawiyah, 
45. know ‘alim (13C), ‘arif (1C), daraa 
(1C), 46. leaf waraqah (2), xuws, 47. lie 
?i ttaka? lean (1), recline (1) munqa‘ar 
uprooted, naam sleep (5), raqad sleep (1), 
lawaaqiH rainbearing winds (1), qa‘ad sit 
(7), musannad leaned against a wall (1), ?i 
nsadaH, ?i nbataH, ?i praja‘, ?i nDaja‘, 71 
stalqaa, tasatran, tamaddad, slqw, slqy, 
jp‘, 48. liver none, kbd, Tunaal, qasbah, 
sqwdaa’?, 49. long tawiyl continual (1), 
livelong (1), madiyd, mustatiyl, 50. louse 
none, gummal tick (1), jiran duwdah 

51. man rajul (54, including plural), 
?insaan (64, singular only. Rejected without 
inspection of meaning. I understand it to 
mean person), baSar mankind (36), mar?, 
?{ mra? (10), rajjaal, maxluwq, ?aadamiy, 
zalmah, zamlah, Salmah, Samlah, 52. many 
kafiyr (5C), waajid, 53. meat lanm (12), 
54. moon qamar (1C), 55. mountain jabal 
(1 2/3C), vali‘, jaarib, jawb, 56. mouth 
faahu, ?afwaah (13), damm, xaSm, Halq, 
buwz, 57. name ?ism (4C), ?alqaab (1) 
58. neck ‘unuq (2), raqabah captive, ‘itrah, 
59. new jadiyd (6), Hadiy@ (less), axiyr, 
60. night layl (3144C), 61. nose ?anf (2) 
xaSm, minxaar, manxuwr, naxarah, napif, 
nadif, 62. not maa (N. selected as probably 
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the most general of a number of negative 
63. one waanid (2C), 64. person ?insaan (fj 
counting singular only), rajul (54), bai 
mankind (36), nafar (3), fulaan ( 
?aadamiy, Saxs, zawl, zamlah, zalmah, §j 
rain matar (6), Sitaa? (1) ?aHyaa, Hayag’ 
Haya, list, 66. red Humur (2) bunniy, (j 
road sabiyl (7C), hudan (2 2/3C), siray 
(144C), vTariyq (5), Trariygah (2), Taraa’ 
(3), darb, rastah, mamarr, jaaddah, Saar! 
sikkah, xatr, zqq, 68. root ?asl (2), say 
leg (3), ji8* trunk (3), ‘urq, jidr, Sur’, Slum 
69. round none, daa’irah change (3), muday: 
wir, mudawwar, mustadiyr, mupbaygan 
kurawiy, murabba‘, 70. sand turaab (2/3() 
raml, baru, 71. say qaal (many), 72. » 
ra’aa (1019C), nadar (214C), 71 Tralat (3 
Saahad, Saaf, ‘ayyan, Taala‘, 73. seed nabi 
grain (5), Surriyyah offspring (1144C), bidrah 
Siyb, 74. sit qa‘ad sit (6) sit at home (Il 
jalas, 75. skin jild, juluwd (9), badarab, 

76. sleep nawm (3), naa?im (2), ruqui 
(1), 77. small sagiyr (8), 78. smoke duxxax 
(2), duxnah, ‘ajaaj, 79. stand qaam (14,( 
waqaf (3), 80. star nujuwm stars (13) kawka 
(6), 81. stone Hijaar (11), Hajar (10), saxari) 
(3), Hasaah, Haswah, wayd, 82. sun Sam 
(occurs), 83. swim none, ‘wm, saban, Sarn, 
*i stanamm, 84. tail none, Sayl, Banal, 
liyyah, 85. that Saalika (N), 86. this haatu 
(N), 87. thou ?anta (N), 88. tongue lisaw 
(1C) 89. tooth sinn (2), pDirs, muqaddin 
90. tree Sajarah (1C), 91. two 7i Onap 
(2/3C), 92. walk masiy (20), saar traw 
(18), 93. warm (hot) Haamiy (3), Harr hes 
(3), muSawwab, Haarr, suxun, saaxin, $4 
water maa? (2C), 95. we nannu (N), 9 
what maadaa (N), 97. white ?abyap (i) 
98. who man (N), 99. woman 71 mra?ab 
(1C), nurmah, sayyidah, ?insaanah, 100 
yellow safraa? (1) 

The resulting Qoranic word-list contains 
total of 88 items which I have compare 
with word-lists obtained from three Meccan 
at present living in eastern Saudi Arabia) 





In eliciting the vocabulary which make! 
up the word-lists of the three informants) 


. 7 j . 
Arabic was the language of communication} eid 
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gatives 
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The following elicitation techniques were 
used. (1) Obtained the noun desired by 
pointing. (2) Used pictures. (3) Acted out 
verbs where possible. (4) Used word- 
analogies such as “What on a bird corre- 
sponds to the hair on a man?” (5) Words 
were obtained through observation of the 
informant’s speech. (6) Opposites were 
requested. (7) Definitions, such as “What 
is the animal which travels in the air?” 
were used. 

In general, where the informant offered 
more than one word, I accepted the first if 
in my experience it was a word which was 
equivalent to the meaning I was seeking. 
At the beginning of the series of investiga- 
tions, I asked for both classical and col- 
loquial alternates. I found quickly that the 
meaning of ‘‘classical’’ varied radically from 
informant to informant, therefore I soon 
ceased to ask for both. Those words listed 
as classical alternates I have rejected. I 
also rejected words with a specialized mean- 
ing and words I inadvertently directly 
suggested to the informant. 

After I had collected the three word-lists, 
I made up a composite word-list by including 
in it all words on which two or three con- 
temporaries agreed. In one case, where no 
majority prevailed, I omitted the item. 

One semantic item taken from the follow- 
ing word-lists will illustrate the method of 
presentation. The Qoranic equivalent, if it 
occurs, is always given first. The code- 
number (4) indicates the composite word- 
list defined above. 

33. give a. 2a ‘taa (034), ‘araa (1), b. 
*ahdaa (2) R. ?addaa (8) (2nd choice). 

The four sources yielded four alternates 
divided into two mutually exclusive cognate- 
sets, a and b, and one rejected item, R. 
Cognate-set a consists of ?a ‘“raa obtained 
from the Qoran (0) and informant 3. This 
itm I have selected to represent the 


' composite-word for contemporary Meccan 
|} as is indicated by (4). Informant 1 said 


‘aTaa, cognate with ?a‘raa, and informant 2 


said ?ahdaa, not cognate with the first two 
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words and so belongs to cognate-set b. 
Informant 3 added ?addaa as an after- 
thought. I therefore rejected this item. 

KEY TO COMPARATIVE LISTS: a, b, c, d: 
mutually exclusive cognate-sets. (0) Qoranic 
word-list. (1, 2,3) word-lists of contemporary 
Meccan informants. (4) word-list of com- 
posite contemporary Meccan. R, rejected 
items. 

1. all a. kull (0234) b. jamiy‘ (1), 2. ashes 
a. HuTaam (01) b. ramaad (2) rumaad (34) 
R. bagaaya ashes of a cigaret (1), xaraaba 
(1) third choice, 3. bark——(0), a. giSir (2) 
gisrah (14) b. linaa? (3) 4. belly a. barn (0) 
bain (1234) R. kirS (3) 2nd choice, 5. big a. 
kabiyr (01234), 6. bird a. rayr (01234), 7. 
bite a. ‘app (0134) b. yaprus (2) R. yigazgiz 
(2) 2nd choice, 8. black a. ?aswad (01234), 
9. blood a. damm (01234), 10. bone a. ‘adm 
(0) ‘admah (1) ‘adum (234), 11. breast a. 
sadr (0124) sadur (8), 12. burn a. ?i Htaraq 
(01) yinrig (2) %a nuaraq (84), 13. 
claw——(0), a. Safr (14) Sifir (3) b. dalf (2), 
14. cloud a. sinaab (01234) R. gaym (3) 
2nd choice, 15. cold a. baarid (01234) R. 
buraad (8) 2nd choice, 16. come a. ?ataa (0) 
b. jaa? (134) ¢. Hapar (2), 17. die a. maat 
(01234), 18. dog a. kalb (01234), 19. drink 
a. Sarib (0) Sirib (1234), 20. dry a. yaabis 
(0) b. naadsif (124) nasSaf (3) R. gaaff (2) 
2nd choice, 21. ear a. ?uSun (0) 7i3in (1234) 
R. widniy (3) 2nd choice, 22. earth a. ?arp 
(01234), 23. eat a. ?akal (01234), 24. 
egg (0) a. bayp (3) baypa (124), 25. 
eye a. ?absaar eyes (0) b. ‘ayn (134) ‘aynu 
(2). 

26. fat a. Suuuwm (0) SaHam (1234), 27. 
feather——(0) a. riySah (134) riyS (2), 28. 
fire a. naar (01234) 29. fish——(O), a. 
samak (234) b. Howr (1) R. nowt (3) 2nd 
choice, 30. fly a. Taar (0134) yiriyr (2), 31. 
foot a. rijl (0) b. qadam (2) gadam (134) R. 
rijil (1) 2nd choice, “colloquial and most 
common.” 32. full a. ?i mtala? (0) malyaan 
(1) mal ? aan (2) ?a tmalla? (34) R. Sab‘aan 
full of food (1) 2nd choice, 33. give a. ?a‘raa 
(034) ‘araa (1) b. ?ahdaa (2) R. ?addaa (3) 2nd 
choice, 34. good a. Tayyib (01234), 35. green 
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a. ?axpar (01234), 36. hair a. ?a3‘aar (0) 
Sa‘ar (124) Sa‘ir (3), 37. hand a. yad (01234), 
38. head a. ra?s (0) ra?as (2) raas (14) Raas 
(3), 39. hear a. sami‘ (02) simi‘ (134), 40. 
heart a. qalb (01234), 41. horn——(0), a. 
qarn (134) garn (2), 42. J a. ?ana (01234), 
43. kill a. quatal (01234), 44. knee——(0) 
a. rukbah (1234), 45. know a. ‘alim (0) b. 
‘if (134) yi'rif (2), 46. leaf a. waraqah (0) 
waraq (3) waragat Sajarah (1) waraga (24), 
47. lie a. i ttaka? (0) b. tamaddad (234) ec. 
?i nsadaH (1) R. naam sleep (1), 48. 
liver (0) a. kibid (1) kibda (24) kibdah 
(3), 49. long a. Tawiyl (01234), 50. louse—— 
(012), a. gamil (34), 

51. man a. rajul (01) rajjaal (134) ragul 
(2), 52. many a. kafiyr (0234) kéiyr (1), 
53. meat a. lanm (0) lanham (1234), 54. moon 
a. qamar (012) gamar (34), 55. mountain 
a. jabal (0134) gabal (2), 56. mouth a. 
?afwaah (0) famm (1234), 57. name a. ?ism 
(0124) ?isim (3), 58. neck a. ‘unuq (0) b. 
raqabah (2) ragabah (134), 59. new a. 
jadiyd (01234), 60. night a. layl (0234) 
layla (1), 61. nose a. ?anf (0) b. xuSm (1) 
xuSum (234), R. ?anf (1) ‘classical’, (2) 
2nd choice, man... (1) First utterance, but 
not completed. When questioned, 1 said 
“manaaxiyr is Egyptian, understood in 
Mecca but not used.” 62. not a. maa (01234), 
63. one a. waaHid (01234), 64. person a. 
?insaan (0), b. Saxs (1234), 65. rain a. 
martar (01234), 66. red a. Humur (0) ?aHmar 
(234) ?anmir (1), 67. road a. sabiyl (0) b. 
tariyg (124) c. darb (3) R. sfilt auto-road 
(3) zugaag alley (3), 68. root a. ?asl (0) b. 
jizir (1) c. saaq (2) d. ‘irq (83)——(4), 69. 
round——(0) a. mudawwar (134) mustadiyr 
(2), 70 sand a. turaab (0) b. raml (1) ramal 
(24) c. barHa (3) R. ramla (3) 2nd choice, 
71. say a. qaal (0234) gaal (1), 72. see a. 
ra?aa (0) b. Saaf (134) Suft (2) R. ra?a (2) 
Said it was not colloquial, 73. seed a. Habb 
(0) b. bizrah (1) ec. dirra (24) dirrah (3), 
74. sit a. qa‘ad (0) ga‘ad (3) b. jalas (14) 
galas (2), 75. skin a. jild (01234) 

76. sleep a. nawm (0) naam (124) b. 
ragad (3) R. ragad (2) 2nd choice, 77. small 
a. Sagiyr (01234), 78. smoke a. duxxaan 
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(01234), 79. stand a. qaam (0) b. wugif (| 
wagaf (234), 80. star a. nujuwm (0) najimal 
(1) najim (2) najma (34), 81. stone a. Hijay 
(0) Hujaar(3) Hajar (124) R. suxuwr big rock 
(3), 82. sun a. Sams (01234) 83. swim——() 
‘aam (134) ya‘uwm (2), 84. tail (0) a 
danab (1) b. Sayl (2) dayl (34) R. liyyah taj 
of a sheep (1) danab (8) 2nd choice, 85. thats 
daalika (0) Saak (3) daak (14) haadaak (2 
86. this a. haadaa (0) haada (13) haada (24) 
87. thou a. ?anta (02) ?inta (34) ?int (1) 
88. tongue a. lisaan (01234), 89. tooth 4 
sinn (0124) sinna (3), 90. tree a. Sajarah (( 
Sajara (1234), 91. two a. ?i Onayn (0134 
Onayn (2), 92. walk a. masiy (0) misiy (134 
masa (2), 93. warm a. Haamiy (0) b. Haar! 
(1234) R. saaxin (2) 2nd choice, 94. wate) 
a. maa? (02) mayya (134), 95. we a. nammi| 
(0) nina (2) ?inna (134), 96. what a. maain| 
(0) maaza (2) b. ?ayS (134) R. ?ays (2 
2nd choice, 97. white a. ?abyap (01234); 
98. who a. man (0) miyn (1234), 99. womail 
a. ?i mra?ah (0) mara (2) b. Hurmah (134) 
100. yellow a. safraa? (0) asfar (1234) 

The formula log r = log C + t7 is applied 
to obtain the rate-constant r. ¢ is the tim 
between Mohammed’s death and 19% 
1.326 millenia. To support Swadesh’s ani 
Lees’ estimates that ‘‘on the average about 
81% of the basic-root-morphemes of a lan- 
guage will survive as cognates after 100 
years, for all languages, at all times,” thi) 
rate-constant for Meccan Arabic shoul 
fall somewhere between the .8048 + .0176 
mill. set by Lees and the 6% above Lew 











level set by Swadesh. In the comparison «' 


each contemporary word-list with the! 
Qoranic word-list the rate-constant r fo 
Meccan Arabic is found to be .823, com: 
pletely supporting Lees’ and Swadesh’ 


estimate. The data are tabulated as follows 
TOTAL NUM- 
INFORM- COGNATES BER OF ITEMS 

ANT RETAINED COMPARED C r 

1 68 88 .773 8 

2 68 88 .773 «£823 

3 68 88 .773 ~=—.82 

4 68 87 .782 «83! 





7 Swadesh, op. cit., 123. 
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This is an unexpectedly close result. As 
noted previously the method used in ob- 
taining the Qoranic word-list would be ex- 
pected to raise the value of the rate-constant. 
In addition the daily public recitation of 
the Qoran in mosques and schools through- 
out the Islamic world would lead one to 
expect that its vocabulary would have a 
higher retention-rate than almost any other 
vocabulary in the world. Such stability 
would tend to raise the value of the rate- 
constant still further above the mean found 
for other languages. Apparently these factors 
have not appreciably affected the rate- 
constant. 

The rate of morphemic decay in Meccan 
Arabic seems to confirm most strongly the 
stability of the morpheme decay rate for all 
languages while failing to support “the 
common view about the conservatism of 
the Semitic languages” mentioned by A. I. 
Samarrai.*® 

DHAHRAN, SAupI ARABIA 


ANNOUNCEMENT 


The Indiana University Graduate School’s 
Committee on Uralic Studies announces 
establishment of the first American publica- 
tion series in Uralic and Altaic studies. 

The initial volume—AMERICAN STUDIES 
IN Uratic Lineuistics—will appear in 
March 1960. It will centain twelve con- 
tributions from as many specialists in this 
field teaching or trained in such American 
universities as Columbia, Indiana, Kansas, 
Michigan, Texas, and Washington. The 
Foreword is by President Frederick Burk- 
hardt of the American Council of Learned 
Societies. 

The Uralic and Altaic Series is edited by 
Thomas A. Sebeok, with Fred W. House- 
holder, Jr., Felix J. Oinas, and Alo Raun 
serving as associate editors. Consulting 
editors are: John Lotz (Columbia Univer- 
sity), Samuel E. Martin (Yale University), 
Nicholas N. Poppe (University of Wash- 


8A. I. Samarrai, Rate of Morphemic Decay 


' in Arabic, IJAL 25.69. 
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ington), and Lewis V. Thomas (Princeton 
University). 

Manuscripts of books, monographs, or 
extended articles dealing with any subject 
in the humanities or social sciences, in 
reference to any of the Uralic or Altaic 
peoples or languages, may be submitted to 
the editor at 


Rayl House 
Indiana University 
Bloomington, Indiana, U.S. A. 


Inquiries about subscriptions may be 
directed to the same address. 
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FOREWORD 


The present volume is the outgrowth of a panel discussion 
“Language and Culture Dynamics in South Asia” which was 
held at the annual meeting of the American Anthropological 
Association in Chicago, December 1957. That symposium 
was sponsored by the Committee on South Asian Languages of 
the Association for Asian Studies and was organized by Charles 
A. Ferguson, who was at that time Chairman of the Commit- 
tee. In the unavoidable absence of Professor W. Norman 
Brown of the University of Pennsylvania, who had hoped to 
preside, Ferguson also acted as chairman of the panel. 

At that session papers by Bright, Chowdhury, and Dimock 
were presented by the authors essentially in the form in which 
they appear in this volume. In addition Professor Uriel 
Weinreich of Columbia University spoke on “The Troubles of 
Hindi” and McCormack’s paper, which he submitted from 
India, was read by Bright. John J. Gumperz, the succeeding 
Chairman of the Committee on South Asian Languages, and 
Selig S. Harrison, Associate Editor of the New Republic, 
acted as discussants. 

Before publication of the papers, Weinreich’s study was 
withdrawn as being of somewhat different approach and more 
appropriate for publication elsewhere, and relevant articles 
by Pillai and by Naim and Gumperz were added to the group. 

At the invitation of Professor Thomas A. Sebeok of 
Indiana University the Committee agreed to publish the volume 
in the format of a Memoir of the International Journal of 








American Linguistics (IJAL), with additional distribution as a 


supplement to the journal Indian Linguistics. 

Ferguson and Gumperz undertook the editing of the papers. 
They made some attempt to bring the transcription, section 
numbering, and style of the various authors closer together 
without interfering with the individual preferences of the 
authors; in several cases they shortened the papers; and they 
provided the Introduction to the volume. They would like to 











express here their appreciation of the patience and helpfulness 
on the part of authors and publishers and to apologize for the 
long period of time which the editing process took. 


Charles A. Ferguson 
John J. Gumperz 
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INTRODUCTION 


Charles A. Ferguson 


Center for Applied Linguistics 


John J. Gumperz 


University of California, Berkeley 


0. Introductory 
l. Variety, language, dialect 
Geographical dialects 


Social dialects 


- ow ON 


Style and superposed varieties 


0. The diversity of languages in South Asia and some of 
its implications are often noted,! but relatively little attention 
is paid to the dialect diversity which exists within the individ- 
ual languages in the area. All the major languages of India, 
Pakistan, and Ceylon have regional dialects as well as dialects 
which correlate with differences of social position, and they 
also have special literary forms of language which differ to a 
greater or lesser degree from the ordinary colloquial. Such 
variation in language is a widespread phenomenon in the 
world — it might almost be called the ‘normal’ state of lan- 
guages —— but the exact nature of the dialect diversity in South 
Asia is not well understood, and its relevance to non-linguis- 
tic aspects of South Asian society and culture has never been 
systematically explored. The studies in this volume consti- 
tute a first attempt to describe and interpret the facts of 
dialect diversity in several specific languages of this part of 
the world. No great effort is made to carry the interpretation 
far afield from linguistics, but each of the studies contains 
suggestive material for the approaches of other disciplines in 
the study of contemporary South Asia. Since these studies 
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are addressed both to linguists and to non-linguists, the edi- 
tors have felt it necessary to provide in this introduction a 
brief summary of the theoretical framework which linguistic 
science currently offers for the treatment of intra-language 
variability. 


1. First of all, one of the fundamental problems of lin- 
guistics is the delimitation of languages as the ‘natural’ units 
of linguistic analysis and classification. Put more directly, 
one of the concerns of linguistics is to find a consistent, sys- 
tematic way to answer such questions as: Are these two 
particular varieties of human speech one language or two? 
Questions of this kind arise all over the world. Are British 
English and American English one language or two? Do all 
the Arabic dialects constitute one language or several? Fa- 
miliar examples from South Asia include: Is Hindi one lan- 
guage or several? Are Konkani and Marathi two languages or 
one? How many Dravidian languages are there?* Questions 
like this cannot be answered until some kind of definition of 
language is widely accepted as valid, or possibly until the 
concept of language as a unit is rejected and another formula- 
tion of the questions becomes accepted. 

Surprisingly little attention is paid to this problem, how- 
ever, and most definitions of language which are in vogue 
among linguists today are more concerned either with setting 
off speech behavior from other human activity or with setting 
off linguistic systems from other semiotic systems than they 
are with defining the limits of single languages. For example, 
linguists frequently define language as “a system of arbitrary 
vocal symbols by means of which a social group cooperates.”? 
This definition offers no clues whatever for the solution of the 
problem since two social groups may use the same language 
or the same social group may use two or more languages; and 
in any case the definition fails to specify ‘social group’ either 
in linguistic or non-linguistic terms. Accordingly, the best 
that can be done here is to summarize in a relatively crude, 
cover-all definition the uses of the term ‘language’ in this 
sense by reputable linguists. * 

In order to do this, one additional concept requires defi- 
nition, the minimal unit which serves as the ‘normal’ object 
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of linguistic description. A whole language is often too varied 
to be readily analyzed by current descriptive techniques, and 
descriptive linguists generally restrict their object of study 
with considerable care. Harris® says, “The universe of dis- 
course for a descriptive linguistic i~.vestigation is a single 
language or dialect.” Bloch® uses the term ‘idiolect,’ the 
“totality of the possible utterances of one speaker at one time 
in using the language to interact with one speaker.” These 
brief definitions, as the authors of them are fully aware, are 
not completely satisfactory. The idiolect concept for example 
is a necessary one, since in practice languages are always 
studied in terms of the speech of single speakers.’ As a unit 
of analysis however an idiolect is defined by extra-linguistic 
criteria, and homogeneity of structure is not a necessary re- 
quirement. Thus the speech of two individuals may be so very 
similar as to cause no difficulty in description while two styles 
of speech of one individual may be very different, and no one 
has shown how to delimit styles rigorously in the speech of 
one person or of many. 

At the present state of linguistics it is probably wise to 
sidestep the issues involved in devising a precise operational 
definition and to settle for a suggestive description. There is 
no term in regular use for this concept of the object of de- 
scriptive linguistic analysis, but the word ‘variety’ has been 
used by several authors in this sense and will be so used here. ° 
A variety is any body of human speech patterns which is suf- 
ficiently | homogeneous t« to be analyzed by available techniques 
of synchronic description a: and which has a sufficiently large 
repertory of elements and their arrangements or processes 
with broad e enough semantic scope to function in all normal 
contexts of communication. With this concept t clarified we 
may now attempt to define a language. 

A first approximation to the definition may be in terms of 
mutual intelligibility. Other considerations aside, two vari- 
eties of human speech are said to be the same language if 
each is readily intelligible to speakers of the other. The dif- 
ficulties of a definition of this kind are many, however, and 
are reminiscent of the difficulties of the biologist’s use of the 
possibility of interbreeding as the primary criterion in the 
definition of the species. 





























In the first place mutual intelligibility is difficult to 
determine’ (or even impossible, as with dead languages). Sec- 
ond, there seems to be little direct correlation between simi- 
larity of phonological or grammatical structure on the one hand 
and mutual intelligibility on the other. This lack of correla- 
tion is disturbing because linguists generally regard these 
structures as fundamental in human speech and often prefer 
to describe linguistic relationships in these ‘internal’ terms. 
Also, we frequently find varieties of speech which do not show 
high mutual intelligibility but which on totally different grounds 
we wish to regard as the same language, and vice versa. A 
well-known example is found in the varieties at the extremes 
of the German-speaking area which are quite different, with 
relatively low mutual intelligibility. They are connected, how- 
ever, by a series of intervening varieties which provide a 
gradual transition, and linguists generally agree in regarding 
all as varieties of one language. 

On the other hand, certain varieties are regarded as dif- 
ferent languages in spite of their high degree of mutual intel- 
ligibility. For example, there is a gradual transition between 
varieties of Dutch and German, but because of the existence 
of two relatively stabilized, widely-used standard varieties, 
Dutch (ABN) and German (Schriftdeutsch), two speech com- 
munities and hence two languages, are usually recognized; 
any transitional variety is assigned to the language whose 
standard form is accepted in the community, regardless of 
the internal linguistic similarities. 

Another commonly used criterion for differentiating lan- 
guages is that of difference in historical development. Two 
sets of varieties are said to constitute different languages if 
they differ significantly in their treatment of certain phono- 
logical and morphological features of a reconstructed parent 
variety. In contrast to the problems encountered in the de- 
termination of mutual intelligibility, historical relationship 
is relatively easy to establish by the rigorous and generally 
accepted methods of historical reconstruction. Grierson 
used this criterion in classifying speech distribution in what 
is now called the Hindi regional language area. He grouped 
the local varieties into five groups, which he calls languages: 
Bihari, Eastern Hindi, Western Hindi, Rajasthani, and Pahari. 
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Each group is set off from the others by peculiarities of gram- 
mar and pronunciation. Modern Hindi-Urdu is part of the 
Western Hindi group, but its historical relationship to the 
other local varieties is by no means always very close. Bihari 
is more directly related to Bengali, the regional language of 
Bengal; some forms of Rajasthani are usually classed with 
Gujerati to the south and the Pahari dialects are closest to 
Nepali, the language of Nepal. }° 

Classifications of the above kind are essential in histor- 
ical linguistics, and they provide valuable information for stu- 
dents of culture history. They do not, however, present all the 
facts of language distribution. In spite of the diversity of 
local speech forms and their relationship to neighboring re- 
gional languages in Grierson’s area, Hindi-Urdu is recognized 
as the language of literature and government throughout. A 
large proportion of the modern urban population and of the ed- 
ucated speaks it and in other ways regards itself as belonging 
to a single Hindi-Urdu speech community. Many modern 
writers thus disregard the historical relationships and refer 
to all local speech forms as varieties of Hindi.’ As in the 
case of the German- Dutch area, the presence of a superposed 
variety, such as a literary standard, is an important factor 
in the determination of language boundaries. This factor is 
irrelevant, however, if the standard language in use in an 
area is extremely divergent or obviously unrelated to the va- 
riety to be classified. All this brings us to our working defi- 
nition. 
A language consists of all varieties (whether only a single 
variety or an indefinitely large number of them) which share a 
single superposed variety (such as a literary standard) having 
substantial similarity in phonology and grammar with the in- 
cluded varieties or which are either mutually intelligible or 
are connected by a series of mutually intelligible varieties. 

It is evident that, as with the term idiolect, purely lin- 
guistic criteria are not currently accepted as sufficient for 
the delimitation of a language. Rather, a given set of varie- 
ties must meet certain minimum linguistic conditions (e. g. 
structural similarity within the set, structural difference 
from varieties outside the set) as well as certain sociological 
conditions (e.g. use of standard, speakers’ feeling of belonging 
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to speech community) in order to be regarded as a single 
language. 

When the concepts of ‘variety’ and ‘language’ have been 
explained it is feasible to define the linguistic notion of ‘dia- 
lect,’ which is crucial to the present studies. It must be made 
clear at the outset that two conventional uses of this term are 
to be disregarded. One is the use of ‘dialect’ to mean an in- 
ferior language of some kind. This use of the term is still 
widespread in such expressions as: “Mr. A speaks six lan- 
guages and ten dialects.” “The inhabitant of that country 
speaks dozens of languages and innumerable dialects.” Since 
no linguistic criterion has ever been devised to differentiate 
superior and inferior languages, linguists prefer to operate 
without this distinction.'* The other is the use of ‘dialect’ to 
mean any non-standard variety of a language. This use is 
still current in expressions such as: “He doesn’t speak good 
German; he speaks only dialect.” “These children are learn- 
ing to speak the language without a trace of dialect.” “This is 
a dialect word not preserved in the literary language.” Many 
languages have one variety which is in certain respects dom- 
inant, the ‘standard’ variety, and it is often useful to distinguish 
between the standard variety and non-standard varieties of a 
given language, but linguists have found it more useful to leave 
the word ‘dialect’ colorless and equally applicable to standard 
and non-standard varieties. Thus linguists may refer to the 
‘standard dialect’ of a given language, and even if they con- 
tinue to use the traditional expression ‘standard language’ they 
generally regard this as a dialect, often, of course, the most 
important from some points of view. }3 

In the terminology of linguistics a dialect is, roughly 
speaking, something between a variety and a language. Some- 
times in the analysis of a language an obvious classification 
of varieties into dialects is clear to the linguist. More often 
a number of alternative analyses are possible, and any par- 
ticular classification into dialects runs counter to at least 
some features which could be used for classification. The 
elementary concept of dialectology is the isogloss. Any 
boundary which constitutes the limits of use of a particular 
linguistic feature is called an isogloss. For example, the 
word frappe is regularly used in the area around Boston to 
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refer to a drink consisting of milk, ice cream, and flavoring 
beaten together. In most of the United States this drink is 
called a milk shake, a term which in the Boston area refers 
to a similar drink made without ice cream. The boundary 
which shows the limits of the use of frappe in this sense, and 
which could be plotted on a map, is called an isogloss. In 
addition to isoglosses concerned with the occurrence of words 
in certain meanings (lexical isoglosses), linguists also make 
use of phonological and grammatical isoglosses. 

In classifying dialects linguists generally prefer to place 
emphasis on isoglosses (a) which cut boldly across large areas 
as opposed to those which are relatively local, or (b) which 
bundle together as opposed to those which are relatively iso- 
lated, or (c) which correlate with non-linguistic criteria of 
classification such as differences in material culture, religion, 
political units, and the like. 

Since dialect classification is largely arbitrary it might 
seem to be of relatively little value or even misleading, but in 
fact the delimitation of dialects is of great importance, both 
within linguistics proper and for extra-linguistic fields. In 
linguistic theory, a dialect may be regarded as the beginning 
of a linguistic split, as a step in linguistic differentiation. In 
other words a dialect is a potential new language, and the con- 
cept is comparable in validity and significance to that of the 
‘sub-species’ or ‘variety’ of biology. For the extra-linguistic 
importance of the dialect we need only note its obvious rele- 
vance to the units of other social sciences such as geographi- 
cal region, social class, or role. 

A dialect is any set of one or more varieties of a language 
which share at sia one feature or combination of features 
setting them apart from other varieties of the language, and 
which may appropriately be treated as a unit on linguistic or 
non-linguistic grounds. Because of the arbitrariness of this 
concept, linguists using the term ‘dialect’ for a particular 
language generally feel under obligation to explain and justify 
the criteria used for their classification in that language. * 


























2. Geographical dialects. The extent and nature of dia- 
lect differentiation in a language may be described in terms 
of two major correlates: density of communication and 
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inter-speaker attitudes. It cannot be maintained that these are 
the only relevant factors since the significance of such matters 
as the structure of the language (Sapir’s ‘drift’),'> interference 
of other languages, phonetic symbolism, or technological change 
can be shown, but on the whole these two correlates provide a 
satisfactory frame of reference for the analysis of dialect 
diversity. 

The assumption with regard to density of communication 
may be phrased in general terms as follows: Other things being 
equal, the more 2 frequently epeekere A and B of language X 











varieties of X spoken by = one will tend to tla identical. }® 

It follows from this that isoglosses will ‘tend to coincide with 
breaks or lines of weakness in communication.!’ Since such 
breaks or lines of weakness will often correlate with natural 
physical boundaries such as mountains or rivers, or with po- 
litical boundaries, or with economic limits (on the flow of goods 
and services), it is clear that a large proportion of dialect 
diversity will be subject to mapping. Possibly because of the 
effectiveness of cartographic representation of dialect differ- 
ences, this has been one of the most highly developed fields of 
work in dialectology. Dialect atlases have been published for 
many languages, and many special techniques of mapping have 
been developed. Apart from limited special studies most books 
in dialectology are either dialect atlases or grammars of non- 
standard varieties. 

In South Asia, surprisingly enough, there has been very 
little work of this kind. In the monumental Linguistic Survey 
of India Grierson offered sample texts and brief descriptions 
‘of many dialects and in some cases provided maps indicating 
what he believed to be the boundaries of major dialects, but he 
did not attempt dialect atlases.1* Almost all the work in South 
Asian dialectology published since Grierson has been based on 
his work, with only a very meager amount of original investi- 
gation, and Grierson’s volumes remain by far the most impor- 
tant source of data for the social scientist concerned with the 
distribution and dialect diversity of South Asian languages. }? 

None of the studies included in this volume is directed 
primarily to the analysis of mappable features of dialect dif- 
ference although all of them assume a background of regional 
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dialects. Accordingly, there is no need here for further dis- 
cussion of the concepts of dialectology concerned with shape 
and distribution of regional isoglosses, such as ‘relic area,’ 
‘fan,’ ‘bundle,’ ‘transition zone,’ ‘focus.’ The important thing 
is to note that all the techniques elaborated in the production of 
dialect atlases elsewhere in the world are available for use in 
the study of South Asian dialectology, and we may express the 
hope that significant studies in this field will be forthcoming in 
the next few years. 


3. Social Dialects. The assumption with regards to inter- 
speaker attitudes may be stated in terms of two processes. 
First: any group of speakers of language X which regards it- 
self as a close social unit will i tend to express its group soli- 
darity b by favoring those linguistic innovations which set it apart 
from other speakers of X who are not part of the group. ~ The 
existence within a speech community of social distinctions such 
as those of caste, class, professional guild therefore gives 
rise to differential rates of linguistic change, favoring the 
creation of new speech differences or the preservation of ex- 
isting ones. On the other hand: other things being aa if 


























and if A regards B as having more 2 prestige e than = and 
aspires to equal B's s status, then the variety of X spoken by 4 A 
will tend towards id identity with that s spoken by B. It may seem 
that this is a circular statement since one of the indications of 
the attitude of prestige is linguistic imitation but it is assumed 
that prestige relationships may be measured independently of 
language. As Leach puts it in describing a somewhat different 
multi-lingual situation: “For a man to speak one language 
rather than another is a ritual act, it is a statement about one’s 
personal status: to speak the same language as one’s neighbor 
expresses solidarity with those neighbors, to speak a different 
language from one’s neighbors expresses social distance or 
even hostility.””° 

The above two processes create a condition of equilibrium:” 
as old speech differences vanish, new ones arise with the ad- 
dition of population elements from the outside or through the 
formation of fresh social distinctions. The total range of 
speech diversity within a speech community, i.e. the actual 
language distance between all the varieties found there, isa 























10 


function of the density of intergroup communication. In soci- 
eties like our modern American one where social class is 
fluid and mass communication media are highly developed and 
shared by most, we would expect the range of variation to be 
relatively small. Differences would most frequently appear on 
the level of phonetics or the lexicon but rarely affect the fun- 
damental phonological or morphological structure of the lan- 
guage. In Asia, however, where intergroup communication is 
severely limited by ritual restrictions we would expect these 
differences to be much greater. 

The parallels between speech variation and social strati- 
fication are manifold. This field of investigation has been 
worked very little by the dialectologist, however, whether be- 
cause of the difficulty of presenting the data, or the problem 
of identification of social groupings, or simple historical ac- 
cident in the development of the science. A handful of valuable 
studies in this field do exist for various languages and show 
great promise for the development of sociolinguistic analysis, ** 
but until now very few such studies have appeared dealing with 
South Asian languages. *? 

Two of the papers in this volume have direct bearing on 
the problem of social dialects. Bright describes the speech of 
a Brahmin and a non-Brahmin speaker of the Mysore dialect 
of Kannada. Both are members of the western educated class 
and have had college education; the linguistic gap between them 
is nevertheless considerable. Of particular interest is the dif- 
ference in receptiveness to outside innovation. The non-Brah- 
min dialect has made a number of changes in the Old Kannada 
grammar and sound system but shows little influence from out- 
side borrowings. The Brahmin dialect on the other hand, pos- 
sibly because of the influence of the literary language, has been 
much slower to change the native Kannada patterns while it has 
been profoundly affected by borrowings from Sanskrit and Eng- 
lish. McCormack in studying the Dharwar dialect of Kannada 
finds three basic speech strata corresponding to the division 
between Brahmin, touchable non-Brahmin and untouchable 
castes. He also discusses the function of dialect differences 
as clues for the determination of a speaker's place in the 
caste hierarchy, a problem which has received considerable 
attention lately. “ 
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The significance of the linguistic studies for broader re- 
search is clear. For example the problem of quantifying caste 
difference in terms of behavior and attitudes is a central prob- 
lem in the analysis of South Asian social structure.** The 
limited amount of data we have already points to some inter- 
esting parallels. If we compare the dialect distribution de- 
scribed by Bright and McCormack and before them by Bloch 
and Aiyar with similar information on North India provided by 
Gumperz, we find that the southern speech communities (or at 
least those in the Kannada, Tamil and Tulu areas) show a fun- 
damental gap between Brahmin and non-Brahmin speech. In 
the North on the other hand (or at least in the Western Hindi 
and Punjabi and probably also in the Bengal area) Brahmin dia- 
lects do not seem to differ significantly from those spoken by 
other high caste touchables. This bears out other recent in- 
dependent anthropological findings regarding the difference in 
status between northern and southern Brahmins. 

The analysis of linguistic behavior in the context of social 
science offers a new dimension of measurement and one that is 
particularly accessible and quantifiable compared to many other 
aspects of behavior. There is every possibility that more de- 
tailed and better planned sociolinguistic investigation of this 
kind in South Asia will lead the way into a whole new area of 
the analysis of society. 


4. Style and superposed varieties. The third kind of 
variation tbo desenibed here is ditiet ent in nature from the 
preceding two. Geographical or social dialects tend to be 
mutually exclusive in the sense that the normal individual is a 
native speaker of only one dialect, and if he speaks more than 
one this is a sign of geographical or social transition of some 
kind on his part. But there are also linguistic variations which 
regularly coexist in the speech of individuals, with their use 
reflecting some kind of situational or role differences. 

It is part of the nature of language, or of human culture in 
general, that there should be individual differences in perform- 
ance and differences in the performance of a given individual 
in various situations. For example, a speaker normally has 
‘favorite expressions’ (e.g. words, phrases, constructions) 
which he uses more frequently than other speakers; and any 
adult speaker normally uses different kinds of speech talking 
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to his young children, participating in a religious ceremony, 
chatting with old friends, and explaining something to a stran- 
ger. To take somewhat different examples, in English the two 
words light and illumination have roughly the same meaning, 
but light tends to be used in less formal, less pretentious 
situations. Or, again, the words fore, aft, deck, companion- 
way have meanings very similar to those of front, back, floor, 
stairway, but the first set is generally used either in a mari- 
time setting or to suggest it, or by people who regularly live 
and work in such a setting. 

All languages seem to show variations of this kind. As is 
the case with social dialects, however, the exact range of these 
variations and the level of linguistic structure at which they ap- 
pear vary significantly. In America we find only minor dif- 
ferences in lexicon, syntax, stress and intonation, which do 
not materially complicate the task of linguistic analysis. Other 
parts of the world however show more fundamental variations. 
One of the most extreme situations of this type is that described 
by Garvin and Riesenberg on Ponape, where different sets of 
bound morpheme alternants and lexical items were used in 
social situations calling for honorific and non-honorific speech. 
Whenever two styles have striking morphological variation (e.g. 
case endings vs. no case endings) or extensive lexical pairing 
(e.g. pairs of different words for common items such as ‘head,’ 
‘go,’ ‘now’), or when both styles in question are regularly used 
under appropriate circumstances by a large segment of the 
speech community (e.g. all educated speakers of the language), 
then special language situations must be recognized which de- 
serve study and which clearly present theoretical and practical 
problems of quite a different nature. The investigation of 
stylistic differences of this kind is of interest to the linguist 
since they constitute a fundamental mechanism of language and 
their analysis provides an insight into more general aspects of 
speech dynamics. For other social scientists correlation of 
style with social situation, personality, and similar factors 
offers valuable clues to the understanding of non-linguistic 
human behavior. 7° 

Some styles of speech are sufficiently striking and exten- 
sive to require study as more or less autonomous marginal or 
partial systems within a language,?’ and often enough the styles 
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appropriate to certain occasions are full-fledged varieties of 
the language existing over and above the primary or ‘natural’ 
speech of individuals. There are many kinds of these super- 
posed varieties, the most important being the various types of 
standard languages. 

Very often one variety of a language, e.g. a certain local 
dialect, becomes widely accepted as standard and is learned 
by speakers of other varieties in addition to their native variety. 
Among typical examples of this standard language vs. local 
dialect situation we may cite important languages of Europe and 
Asia (e.g. German, Italian, Persian) where many speakers use 
a local dialect in familiar or ‘home’ situations but use the stand- 
ard language in more formal or ‘outside’ situations. In some 
speech communities there may even be several layers of super- 
posed varieties of this sort, with national and regional stand- 
ards in addition to local dialects. To complicate this picture 
there may also be regionally standardized pronunciations of the 
national standard as well as various mixed varieties appropri- 
ate to certain occasions or used by individuals who have not 
fully mastered certain levels.** This layer phenomenon is 
fairly frequent in South Asia and has recently been described 
for Hindi.*? Another kind of superposed variety is illustrated 
by the classical-colloquial dichotomy of Arabic, modern Greek, 
or Swiss German where the standard form of the language is 
used for written and formal spoken purposes only, the other 
for ordinary conversation.” The spread of a particular stand- 
ard variety is determined by social and political factors and 
often has little relation to the actual language distance between 
the standard and the local dialects. Thus, returning to our 
German example, we find High German used as a standard by 
many speakers whose native dialects are much closer to Dutch. 
Another interesting case in point is that of Southern Nepal 
where native speakers of Maithili, a dialect related to Bengali, 
are advodating the adoption of Hindi as a standard language. 
Other Maithili speakers across the border in Bihar are opposed 
to Hindi and favor the creation of a new literary standard based 
on medieval literary Maithili. 

Stylistic variation and superposition are important factors 
throughout South Asian society and four studies in this collec- 
tion are explicitly concerned with this kind of variation: Pillai’s 
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study of Tamil deals with a situation which is quite similar to 

the above mentioned cases of Arabic, modern Greek, and 

Swiss German. There are two distinct literary and colloquial 
styles, showing striking differences in phonology and morphol- 
ogy. The former serves as the sole medium of writing and 
formal speech making, while the latter is used for conversa- 
tion among the educated. Dimock deals with the two standard 
varieties of Bengali, where conditions are somewhat different. 
Sadhu Bhasa, like literary Tamil, is employed for writing and 
formal speeches. Colit Bhasa, the conversational medium of 
the educated, however, has become accepted as a vehicle of 
literature in its own right during the last seventy five years. 

It serves as the literary symbol of the newly arising urban mid- 
dle class, in contrast to traditional culture represented by Sad- 
hu Bhasa. Chowdhury describes the relationship between the two 
standards and the regional dialects of East Bengal and details the 
social situations in which they are used. He also points to the in- 
cipient development of a third Eastern Bengali standard variety, 
resulting from political conditions arising out of the partition of 
Bengal. The Naim-Gumperz paper deals with the Hindi-Urdu 
area where two literary varieties have developed from a single 
conversational standard and have become symbols of the com- 
peting Hindu-revivalist and Indo- Muslim cultural complexes. 


Notes 


1. Cf. S. Harrison, The challenge to Indian nationalism, 
Foreign Affairs 34. 620-36 (1956); M. Windmiller, Linguistic 
regionalism in India, Pacific Affairs 27. 291-318 (1954); R. 
Pieris, Bilingualism and cultural marginality, British Journal 
of Sociology 2. 328-39 (1951). The fullest factual account of 
South Asian languages is still G. Grierson, Linguistic survey 
of India (19 vols. Calcutta, 1903-28). A good summary is pro- 
vided inS. K. Chatterji, Languages and the linguistic problem 
(London, 1943). An extensive bibliography on politico-linguis- 
tic problems of South Asia is found in S. Harrison, The dan- 
gerous decades (New York, 1955). S. K. Chatterji, Indo-Aryan 
and Hindi (Ahmedabad, 1942) is a valuable historical treatment 
of one important area. 














2. For a discussion of the widely divergent statistics on 
such questions in South Asia, see J. J. Gumperz, Some re- 
marks on regional and social language differences in India, in 




















15 


M. Singer (ed. ), Introducing India in liberal education (Chicago, 
1957). 


3. Cf. B. Bloch andG. L. Trager, Outline of linguistic 
analysis (Baltimore, 1942). 


4. For valuable recent discussions of the whole language- 
dialect question, see A. Martinet, Dialect, Romance Philolo 
8.1 (1954) and C. F. Hockett, A course in modern linguistics 
Chs. 38, 39 (New York, 1958). Perhaps the best discussion of 
this problem in a broader sociological framework is that of H. 
Kloss, Die Entwicklung neuer germanischer Kultursprachen 
(Munich, 1952). 


5. Z. S. Harris, Methods in structural lin nguistics 9 (Chi- 
cago, 1951). 


6. B. Bloch, A set of postulates for phonemic analysis, 
Language 24.7 (1948). Other authors define ‘idiolect’ differ- 
ently. Cf. Hockett op. cit. 


7. Cf. Hockett, op. cit., 321. 


8. A. Kelkar in unpublished works has used for this the 
term ‘synlect’ based on the term ‘lect’ which is a key item in 
his comprehensive terminology for linguistics. 

















9. Some first steps have been taken in this field, and as 
better testing techniques and more satisfying methods of quan- 
tification are developed, measures of mutual intelligibility may 
become increasingly useful for linguistic descriptions. Cf. 

C. F. Voegelin and Z. S. Harris, Methods for determining 
intelligibility among dialects of natural languages, Proceedings 
of the American Philosophical Society 95. 322-9 (1951); J. E. 
Pierce, Dialect distance testing in Algonquian IJAL 18. 203-10 
(1952). 


10. Grierson, op. cit., vol. I. The details of Grierson’s 
classification are open to question on technical grounds but 
this does not invalidate the historical approach as such. 





11. See for example Dhirendra Varma, Hindi literature, 
in Literatures in modern Indian languages (Delhi, 1951). 
Much of the controversy regarding the linguistic affiliation of 
language units such as Maithili, Konkani, Bhili and others 
could be avoided if the distinction between language units de- 
fined by historical and superposed relationships were made 
clear. 








12. Non-linguistic criteria may of course be used to rate 
languages, e.g. the use of writing, kind of religion, economic 
structure, etc., but it seems wise not to use the term ‘dialect’ 
in this connection. Also, it may well turn out that languages 
can be rated on certain purely linguistic scales, such as 
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regularity of morphophonemics, ability to form compounds, or 
to transfer morphs from one word class to another, etc., but 
such classification has not yet been systematically carried 

out and in any case will undoubtedly fail to provide a linear 
scale of absolute rating for languages as wholes. Cf. R. S. 
Wells, Archiving and language typology, IJAL 20. 101-7 (1954). 


13. For discussions of the rise of standard varieties in 
the languages of Europe, see J. Vendryes, Language a lin- 
uistic introduction to history, tr. P. Radin 260-79 (London, 
1931); A. Meillet, ‘Les langues dans l’Europe nouvelle, Chs. 
VIII-XIV (2nd ed. Paris, 1928). H. Kloss, Entwicklung at- 
tempts an analysis of the process of standardization. 











14. Other definitions of dialect are, of course, used for 
special purposes. For example, Bloch in the Postulates defines 
a dialect for the purpose of that study as a set of idiolects shar- 
ing the same phonology. 


15. E. Sapir, Language 157-182 (New York, 1921). 


16. It must be noted that different kinds of communication 
very probably have different strengths under this assumption 
and that this whole area constitutes an important field for 
sociolinguistic investigation. 


17. Cf. L. Bloomfield, Language Chs 3, 19 (New York, 
1933). 


18. In noting that Grierson did little or no dialect atlas 
work, we have no intention of belittling his tremendous achieve- 
ment. The LSI remains one of the world’s major productions 
of linguistic scholarship, and Grierson himself was fully aware 
of the necessary limitations of his project. 


-19. Important original work in South Asian dialectology 
has, however, been done in the detailed historical studies of 
particular languages or regional dialects. For example, S. K. 
Chatterji, Origin and development of the Ben aii language 
(Calcutta, 192 D. Varma, La langue Bra (Pavis, 5); B. 
Saksena, catation of Awadst Cileiebed 38); S. M. Katre, 
Formation of Konkani (Bombay, 1942). 


20. E. R. Leach, Political systems of highland Burma 
39 (London, 1954). Cf. also the following statement by Martin 
Joos, The medieval sibilants, Language 28. 222-31 (1952) 
quoted in John L. Fischer, Social influences on the choice of a 
linguistic variant, Word 14. 47-61: “the dialects and idiolects 
of higher prestige were more advanced in this direction and 
their speakers carried the drift further along so as to main- 
tain the prestige-marking difference against their pursuers.” 














21. Cf. also Fischer, op. cit., p. 52, footnote 7. Fischer 
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states: “Joos seems to me to be unique in his recognition that 
the two processes (i.e. prestige imitation and the use of lin- 
guistic features to emphasize exclusiveness) combine to con- 
stitute a self perpetuating cycle.” 


22. Two examples: R. McDavid, Post-vocalic /-r/ in 
South Carolina, a social analysis, American Speech 23. 194-203 
(1948); S. Sapon, A methodology for the study of socio-economic 
differentials in linguistic phenomena, SIL 11. 57-68 (1953). 


23. Two excellent older studies exist: J. Bloch, Castes 





texts in two dialects, Bulletin of the School of Oriental Studies, 
6. 897-931 (1932). One recent study is explicitly concerned 
with this problem: J. J. Gumperz, Dialect differences and 
social stratification in a North Indian village, American An- 
thropologist 60. 668-82 (1958). eer 


24. See G. N. Putnam and E. M. O’Hearn, The status 
significance of an isolated urban dialect, Language 31, Supple- 
ment, Language Dissertation No. 53 (1955) and the bibliography 
given there. 








25. A recent issue of the Indian anthropological journal 
Man in India was devoted almost entirely to this problem: Man 
in India 39. 2 (1959). 


26. Cf. Bloch, op. cit., Aiyar, op. cit., Gumperz, op. 
cit. (1958), P. L. Garvin and S. H. Riesenberg, Respect be- 
havior on Ponape, American Anthropologist 54.201-220. Most 
linguistic studies of style have been concerned with literary 
style, but increasing attention is being paid to the broader as- 
pects of style differences in language. For an interesting 
modern discussion see John L. Fischer, op. cit. Cf.. also the 
symposium on Casual and Non Casual Language held at the 
American Anthropological Association annual meeting, Chicago, 
1957, papers from which were published in International Jour- 
nal of American Linguistics 24. 253-272; and see now especial- 
ly T. A. Sebeok, ed., Aspects of Style in Language (New York, 
1960). A more traditional but interesting approach to the sub- 
ject is found in: P. L. Garvin ed., A Prague school reader on 
esthetics, literary structure and style (Washington, 1955). 




















27. For example, cf. C. A. Ferguson, Arabic baby talk, 
For Roman Jakobson 121-8 (The Hague, 1956). 


28. For this layer phenomenon in German, cf. W. Schon- 
berger, Die sprachliche Verhaltnisse der Tirol-Salzburg- 
Bayerischen Landerecke, Teuthonista 10.62 (1934); H. Moser, 
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Mundart und Hochsprache im neuzeitlichen Deutsch, Der 
Deutschunterricht 8. 36-61 (1956). 


29. J. J. Gumperz, Language problems in the rural de- 
velopment of North India, Journal of Asian Studies 16. 251-9 
(1957). 

30. Cf. C. A. Ferguson, Diglossia, Word 15.2 (1959), 
where this language situation is described in more detail. 
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Conclusions 


0. Dialect differences in the languages of India may be 
thought of in a three-dimensional framework: in addition to 
the horizontal distribution of geographical dialects, as is found 
throughout the world, India offers exceptionally clear cases of 
dialects which are spoken in a single spot, but which may be 
arranged in a vertical scale correlated with social class. 
These are the caste dialects, such as those of a North Indian 
village which have recently been described by Gumperz.! In 
the study of such dialects, the following question may be raised: 
Is there a correlation between the amount of linguistic change 
manifested in a dialect and the social status of the people who 
speak it? In other terms, in what caste dialects are the more 
archaic features to be found? Investigation of this question 
may require separate consideration of various parts of the lan- 
guage: that is, phonemic change, grammatical change, and 
lexical change may not all operate at the same speed in a given 
dialect. 

This paper does not attempt to answer the question com- 
pletely, but merely to contribute toward an answer, primarily 
by comparing data recorded from two Kannada dialects of 
Bangalore District, Mysore State, South India. One isa 
Brahmin dialect, as spoken by a young woman born and raised 
in the city of Bangalore. The other is of the agricultural 
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Okkaliga community, as spoken by a young man from a village 
a few miles outside the city. Both informants have college 
educations.* The transcriptions are in terms of the following 
phonemes: /pttckbddjgfis Szmnnyjvilyrhiieé 
ziaauwuodos3/.? No attempt is made here to describe all 
existing differences between the two dialects in question; the 
differences which are described are those best attested in the 
data. 


1. The phonemic systems of the two dialects display the 
following differences: 

1) The Brahmin dialect (hereafter abbreviated as B.) has 
the phonemes /f z 3/ in loanwords from English; in the Okkaliga 
dialect (O.) these are replaced by /pj 4/ respectively. E.g.: 
B. kofi, O. kapi ‘coffee’; B. dazan, O. dajan ‘dozen.’ 

2) B. /c/, when intervocalic, sometimes corresponds to 
O. /s/: B. samacara, O. samasara ‘news’, from Skt. 
samacara-. The other cases noted are also loanwords. The 
data suggests that O. lacks contrast between /c/ and /s/, at 
least in medial position. This is reminiscent of the situation 
in Tamil, where affricate and fricative are members of one 
phoneme;* the situation was probably the same in Primitive 
Dravidian.> — 

3) B. /8/, occurring mainly in loanwords from Sanskrit, 
often corresponds to O. /s/: B. Santi, O. santi‘peace’; B. 
déSa, O. désa, ‘country’; B. kaSta, O. kasta ‘difficulty.’ (The 
B. forms are identical with the Sanskrit stems in each case. ) 
The lack of contrast between /s/ and /3/ is also a character- 
istic of Tamil, and presumably of Primitive Dravidian. 

4) B. /h/ usually corresponds to O. zero: B. haku, O. 
aku ‘put’. After pause, B. /he hé ho hd/ correspond to O. /ye 
yé va vo/ respectively. This is in conformity with morpho- 
phonemic changes also operative in B., by which onglides are 
added to mid vowels after pause, and /vo/ is further replaced 
by /va/. E.g.: B. hesru, O. yesru‘name’; B. hoge, O. vage 
‘smoke’; B. hogu, O. vogu ‘go’. In all these cases, the B. 
forms represent the earlier stage of development, reflecting 
PDr. forms with initial *p: cf. OKa. pesar, Ta. peyar, Te. 
péru ‘name’; OKa. poge, Ta. pukai, Te. poge ‘smoke’; OKa. 
pogu, Ta. poku, Te. pdvu ‘go’. 

5) Aspirated stops of B. correspond to unaspirated ones 











/ 
a 
b 
Vv 
B 
Cc 











eee 





21 


in O. The examples are principally loanwords: B. bhimi, O. 
bimi ‘land’ (Skt. bhimi-). 

6) Syllable-initial consonant clusters, occurring in loan- 
words in B., are rare inO. Thus sequences of sibilant plus 
oral sonorant in B. correspond to /s/ alone inO.: B. svami, 
O. sami ‘sir’ (Skt. svamin-); B. man3ya, O. mansa ‘man’ 
(Skt. manuSya-). B. /s/ plus nasal sometimes corresponds to 


O. /si/ plus nasal: B. snéyta, O. sinéyta ‘friend’ (Skt. snehita-). 


B. /k8/ corresponds toO. /c/: B. k3emsu, O. cemsu ‘excuse 
me’ (from Skt. Vk3am ‘to pardon’. ) 

7) B. /&é/ often corresponds to O. /ya/ when a mid or low 
vowel occurs in the next syllable: B. péte, O. pyate ‘town’. In 
these cases the pronunciation with /é/ is older: cf. Ta. péttai, 
Marathi peth. 

8) B. /o/ often corresponds to O. /a/, as in the second 
morpheme of B. mal-koll-dna, O. mal-kal-ani ‘let's go to bed’. 
This element, having reflexive meaning in Ka., is cognate with 
Ta. kol ‘receive’, Te. konu ‘take’, Cf. also B. barokke, O. 
barakke ‘for coming’; B. agolla, O. agalla ‘doesn't become’. 
Both of these are contracted from forms such as the literary 
equivalents bar-uvu-da-kke and ag-uvu-d-illa respectively. 

9) B. /a/ in the sequence /ri/, occurring in loans from 
Sanskrit, usually corresponds to O. /u/: B. sringara, O. 
sujgara ‘beauty’ (Skt. $rngara-. ) 

The foregoing correspondences may be summarized as 
follows: Insofar as it fails to accept new phonemic contrasts 
occurring in loanwords, the O. dialect is the more archaic of 
the two. It is innovating, however, in its loss of earlier /h/, 
and in the changes of /@/ to /ya/ and /o/ to /a/. 


2. Grammatical differences between the two dialects in- 
clude the following: 

1) The dative suffix of B. has the allomorphs -kke after 
/a/ and -Age elsewhere. (Aisa morphophoneme commonly 
actualized as zero after consonants; a vowel plus A, however, 
becomes immune to the syncope which otherwise affects short 
vowels.) This corresponds to O. -ke after nouns of all types: 
B. tta-kke, O. iit-ke (with syncope) ‘to dinner’; B. madve-ge, 
O. maduv-ke ‘to a wedding’; B. market-ge, O. market-ke ‘to 
the market’. O. also has the variant -kya: B. péte-ge, O. 
pyat-kya ‘to town’. Historically, B. -kke and -Age may be 
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traced to OKa. -kke and -ige respectively. OKa. also has the 
form -ke, however;® this has been lost in B., but generalized 
in O. to the point of replacing other allomorphs. 

2) Locative formations in B. contain an element -alli, 
corresponding to O. -agi: B. bandy-alli (also contracted to 
bandili), O. bandy-agi‘in a cart’. The B. form may be identi- 
fied with the adverb alli ‘there’; the O. form is of uncertain 
origin, though homonymous with agi ‘having become’, from agu 
‘become’. Formations in -alli are attested in OKa., along with 
several other types lacking in the modern dialects;’ those in 
-agi are apparently innovations. 

3) The first person plural imperative suffix is B. -6na, 
O. -ani, -ani, -ana. E.g.: B. hog-6ona, O. vog-ani ‘let’s go’; 
B. mugs-6na, O. mugs-ani ‘let's finish’; B. kit-koll-6na, O. 
kutit-koll-ana ‘let’s sit down’. Cf. OKa. -uvam, MKa. -uva, 
modern written Ka. -uva, -uvana, -uvana, -dna. ® If, as is 
claimed by Kittel, /n/ is earlier than /n/ in this suffix, then 
the O. forms are archaic in retaining it. The vowels of the O. 
forms, however, appear to be innovations. 

4) Certain verb stems show different allomorphy in the 
two dialects: agu ‘become’, but B. ag-ta O. a-ta ‘becoming’; 
B. hdgu, O. vdgu ‘go’, but B. hég-ta, O. vo-ta ‘going’; ba 
‘come’, but B. bar-ta, O. bat-ta ‘coming’. The O. forms are 
unattested in older Kannada. A different case, however, is B. 
ki-tu, O. kun-tu ‘having sat’ (from a stem occurring only in 
these forms); both of these date from the medieval period, 
being attested from 1398 and 1585 respectively. ? 

5) The verb iru ‘be’ has a set of forms in B. which lack 
a tense suffix; corresponding to the third person forms of these, 
O. uses pronouns plus the emphatic morph-e. E.g.: B. 
iddane, O. avn-e ‘he is’ (avnu ‘ he’); B. iddare, O. avr-e ‘ they 
are’ (avru ‘ they’); B. ide, O. ad-e ‘it is’ (adu ‘it’). All of 
these are modern formations: of the O. forms, however, only 
the neuter ad-e is found in the modern written language. !° 
The other O. forms may represent an analogical extension on 
this model. 

The foregoing grammatical correspondence thus show the 
B. dialect to be generally more archaic. 


3. Finally, the following cases typify the differences of 
vocabulary between the two dialects (OKa. and other Dravidian 
forms are from Kittel's dictionary): 
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1) In some cases, where there is only a slight phonemic 
difference between B. and O. forms, the B. form is known to 
be the earlier one. e.g.: 

B. hordu, O. voldu * start out’; cf. OKa. pora, pera ‘out’, 
Ta. pir ‘outer’, Malayalam pur ‘outside’, Te. per ‘foreign’. 

B. baglu, O. baklu ‘door’; cf. OKa. bagil. 

B. bombe, O. gombe ‘doll’; cf. OKa. bombe, Ta. pommai, 
Te. bomme. 

B. matte, O. mante ‘again’; cf. OKa. matte, Ta. marrai. 

2) In other cases, the O. form is known to be earlier; _ 
there is not necessarily any similarity between the two forms. 
EB. g.: 

B. prayana, O. payna‘journey’. The B. form is a recent 
copy of Skt.; the O. form is from OKa. payana, which in turn 
is from Skt. prayana- or prayana-. 

B. bahla, O. balu*much, very’. Cf. MKa. bahala, from 
Skt. bahala; but OKa. bal ‘strength, greatness’, Ta. val, Te. 
balu. 
togo, O. teko ‘take’; cf. OKa. tekkoll-. 
hége, O. yenge ‘how’; cf. MKa. hége, but OKa. henge. 
huli, O. yesru Decl Me cf. OKa. esar. 
uta madu, . unnu ‘eat a meal’; cf. OKa. un, big un. 

3) In the remaining cases, the relative age of the forms 
is not clear. E.g.: 

B. sumne, O. sumke ‘quiet’; cf. OKa. summane and 
summage, both attested in the same work. 

B. yeStu, O. yésu ‘how much’, both attested in the same 
MKa. work. 

B. sarti, O. sdri‘time, turn’. Both are attested only in 
modern times. The B. form may be from Skt. sarat ‘going’; 
with the O. form, cf. Malayalam sari. 

B. solpa, O. rav-astu ‘a little’, both modern. The B. 
form is from Skt. svalpa; the O. form is literally ‘as much as 
(astu) a grain (rave)’, cf. Te. rav-aste. 

B. snana madu, O. nir ak-k6d ‘bathe’, both modern. B. is 
literally ‘to make a bath’, with Skt. snana- ‘bath’; O. is liter- 
ally ‘to put water on oneself’: 

B. sinima, O. baysk6opu ‘movies’, from English cinema 
and bioscope respectively; the former may be the more recent 
borrowing. 


al 
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4. Generalizing from the above comparisons, we may 
draw the following conclusions: O. accepts foreign words and 
foreign phonemic patterns less readily than does B. Its vocab- 
ulary is thus more archaic, and its phonemic system is like- 
wise archaic in that it lacks certain contrasts which B. has 
taken over from Sanskrit and other languages. B., on the 
other hand, is more resistant to phonemic changes of native 
sounds, and to grammatical change. 

Reasons for B.’s receptiveness to importations from other 
languages are not hard to find: the Brahmins have traditionally 
been conversant with Sanskrit, and more recently with English. 
The prestige attached to both these languages has increased 
the likelihood for bilingual speakers to introduce words from 
them into Kannada. It is not quite as easy to account for the 
conservative tendency of B. as regards phonemic and gram- 
matical change. A conditioning factor may be the greater 
literacy of the Brahmin community; that is, the permanence of 
the written word may retard change in the spoken language. 

In this connection, data from Tulu, another Dravidian lan- 
guage of South India, deserves examination. Although separate 
dialects of Tulu are spoken by Brahmins and by non-Brahmins 
or Shudras, neither dialect is commonly written. We may 
therefore hypothesize that the B. dialect of Tulu will show no 
greater retardation of phonetic change than the S. (Shudra) 
dialect. The Tulu data available!’ shows, first of all, some 
correspondences in which the B. forms are more archaic: 

B. initial vowel corresponds to S. zero in a number of 
cases where PDr. has the initial vowel: B. imbolu, S. modlu 
‘this woman’ (Ta. ival, Ka. ivalu); B. edangur, S. dangui ‘left 
(hand)’ (Ta. itai, Ka. yeda): B. unpu, S. nuppu ‘food’ (cf. Ta. 
un, Ka. unnu ‘to eat’. ) ; 

B. /s/ corresponds to S. /t/ in some cases where PDr. 
has *c: B. sik-, S. tik- ‘be obtained’ (Ta. cikku, Ka. sikku); 
in retaining sibilant quality, B. here appears more archaic. 

B. /j/ corresponds to S. /d/ where PDr. has *f!? in B. 
jefiji, S. defiji‘crab’ (Ta. fiantu, fientu, Ka. yédi). B. appears 
more archaic, in that it retains palatal quality. 

There are, however, other correspondences in which S. 
has the more archaic forms: 
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B. initial /é/ corresponds to S. /ya/ in some cases where 
PDr. has *ia:!3 B. énw, S. yanur ‘I’ (Ta. yan, OKa. 4n); B. 
érur, S. yarur ‘who’ (Ta. yar, ar, Ka. yaru.) 

B. /s/ corresponds to S. /t/, where PDr. has *t, in B. 
sdj-, S. tdj- ‘appear’ (Ta. ténru, Ka. tdru. )!* 

In another correspondence, both dialects innovate, and 
neither can be considered the more archaic: B. /1/ corresponds 
to S. /r/ where PDr. has *r:!5 B. bale, S. baree ‘plantain’ 
(Ta. varai, OKa. bare, modern bale); B. k6li, S. kori ‘hen’ 
(Ta. kori, OKa. kori, modern kéli); B. bal-, S. bir- ‘fall’ (Ta. 
vir-, OKa. bir-, modern bilu. ) 

On the basis of the limited data, we may therefore con- 
clude that the difference between B. and S. dialects of Tulu, as 
regards rate of phonetic change, is less marked than that be- 
tween the B. and O. dialects of Kannada. A further study of 
Tulu should be especially rewarding in the investigation of 
Indian caste dialects. 


Notes 


1. John J. Gumperz, Dialect differences and social 
stratification in a North Indian village, American Anthropolo- 
gist 60. 668-82. (1958) 


2. The Brahmin idiolect presented here is that of my 
chief informant in an intensive study of the Bangalore Brahmin 
speech, and may be taken as typical. The Okkaliga dialect, 
on the other hand, was studied only briefly. The data obtained 
is not ‘pure’ Okkaliga speech; that is, the informant’s idiolect 
has been influenced by the prestige of the Brahmin dialect. 
Even so, the salient features of Okkaliga speech show up 
clearly enough in the data recorded from him. 


3. The phonemic system of Kannada is described in de- 
tail in my monograph, An outline of colloquial Kannada (Poona, 
1958.) 


4. Murray Fowler, The segmental phonemes of Sanskrit- 
ized Tamil, Language 30. 360-67 (1954). 


5. T. Burrow, Dravidian Studies VI, Bulletin of the School 
of Oriental and African Studies (University of London) 12. 132- 
47 (1947). Primitive Dravidian is hereafter abbreviated PDr.; 
so also Ka. for Kannada, OKa. for Old Kannada, MKa. for 
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Medieval Kannada, Ta. for Tamil, Te. for Telugu, Skt. for 
Sanskrit. 


6. A. N. Narasimhia, A grammar of the oldest Kanarese 
inscriptions 141-46 (Mysore, 1941). 


7. Narasimhia, Grammar, 151-54. 
8. F. Kittel, A grammar of of the Kannada language 148-51 





(Mangalore, 1903) “The symbol n m 1 stands for the Kannada sonne, 


pronounced (in this position) as /m/ in modern Kannada. 


9. F. Kittel, Kannada-English Dictionary 441, 457 
(Mangalore, 1894). 


10. Kittel, Grammar, 133-34. 


1l. From L. V. Ramaswami Aiyar, Materials for a sketch 
of Tulu phonology, Indian linguistics 6. 385-439. The author's 
phonetic symbols are altered to agree with the transcription 
used above for Kannada. 


12. According to T. Burrow, Dravidian Studies V, BSOAS 
11. 595-616 (1946), esp. 605. 


13. Burrow, Dravidian Studies V, 599. 


14. The complicated history of Tulu initial /t c s dj/ is 
dealt with by Ramaswami Aiyar, Tulu initial affricates and 
sibilants, Quarterly journal of the Mythic Society (Bangalore) 
22. 259-73 (1932). However, the author does not clearly dis- 
tinguish the various Tulu dialects represented in his material. 
Thus, the stem meaning ‘to be entangled’ is given as cikk-, 
tikk-, sikk-, and sikk-, only the first being distinguished as 
‘rare sub-dialectal.’ 











15. Information on the history of *r, and on its modern 
pronunciation, is given in Bh. Krishnamurti, Proto- Dravidian 
*z, Indian linguistics 19. 259-93 (= Turner jubilee volume I, 








1958.) 
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TAMIL ——- LITERARY AND COLLOQUIAL! 


M. Shanmugam Pillai 
Annamalai University 


1. Introductory 
Phonological differences 


Morphological differences 


A Ww ON 


Conclusion 


1.0 This is a comparative study of the differences in pho- 
nology and in noun inflection between two distinct styles of 
Tamil, which the present author uses on different social occa- 
sions. The literary Tamil (LT) is the one he employs in class- 
room lectures, in public meetings, and on certain other formal 
occasions. The vocabulary of this latter style corresponds to 
his writings in journals. The pronunciation is similar to that 
of the All India Radio news broadcasts in Tamil and to that 
ordinarily employed in reading a written text. The colloquial 
Tamil (CT) is the one he uses in ordinary conversation, when 
talking to his family and friends and also to students outside 
the classroom. It is similar to the style employed by other ed- 
ucated native Tamils throughout the present Madras State, but 
differs from the speech of the uneducated. The dialects used 
by the latter show a number of regional and caste variations, 
which have not as yet been systematically studied. * 


1.1 The LT demands a few more clarifications. Prose 
writings of the last century and even of the early years of this 
century were replete with Sanskrit words. During the early | 
decades of this century the purist movement, started by 
Maraimalai Adigal, which advocated the removal of all foreign 
words from Tamil, left a permanent impact on modern Tamil 
prose. Very few writers were left untouched by this impact. 
Modern Tamil prose including that of the present author, though 
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not one hundred per cent ‘pure’, is purer than what it was a few 
decades ago. These changes however were confined to the 
lexicon and have had little effect on the structure of the language 
as such. The LT described here also differs from the rhetori- 
cal style used in the speech making of certain political organi- 
zations which is characterized by long sentences, complicated 
syntax and certain characteristic intonation patterns. It is 
further distinct from the so-called Pandit style replete with 
archaic technical terms and grammatical features. This latter 
style is occasionally used by the present author while discussing 
ancient grammatical treatises but not in talking about more gen- 
eral subjects and writing for the general public. 


2. LT and CT have the same inventory of phonemes. * 
Consonants 


labial dental alveolar retroflex palatal velar 


Stops p t t t é k 
b d d J e 
Spirants f = ¢ 


Nasals m 


Ip 
3 
3 

Ee} 

3 


Laterals 1 1 
x 
i 
Semi-con- v y 
sonants 
Vowels 


i e or a ° u 
aor. les aie Hes. ae 


The phonetic correlates* of the above symbols are similar for 
the two styles, with the exception of the word final allophone of 
/n/, which is [n] in LT and [~] in CT. 

Although the phonemic inventory is the same, the two 
styles show a number of differences in the distribution of the 
individual phonemes. These are classified and listed in the 
section below. Some of the correspondences found are regular, 
i. e. they are predictable in terms of the LT phonemic 











e 
T 
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environment. Others are confined to limited lists of items. 

The historical relationship between the LT and CT forms has 
not yet been investigated in detail. It was found convenient, 
however, to give lists of correspondences arranged as they 
might be arranged in a comparative historical study, with the 
LT forms taken as primary.* Process terminology is occa- 
sionally used, although this does not necessarily have historical 
implications. 


2.1 Final consonant correspondences. 

CT has no final consonants with the exception of final /n/, 
(phonically [~]) and of the final /m/ in /o:m/ ‘name of the deity’ 
(a Sanskrit loanword used in prayer). The various correspond- 
ences of final LT consonantal themes are listed below. ° 


(1) CV:y regularly corresponds to CV 


pa:y pa: mattress 
te:y te: rub 

ka:y ka: nut 

na:y na: dog 

no:y no: disease 
vary va: mouth 


(2) CVy regularly corresponds to CVyyi 


piy Ppiyyi tear 

ney neyyi ghee 

tey teyyi do 

kay kayyi hand 

vay vayyi put 

poy poyyi lie (untruth) 
(3) Final -ay regularly corresponds to -e 

pillay pille child 

tontay tonte throat 

éattay Eatte shirt 

kattay katte block of wood 

katay kate shop 

vatay vate an edible 

malay male mountain 


natay nate step 
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(4) 


(5) 


(6) 


(7) 


(8) 


Monosyllabics of the structure (C)VC,, where Cc, 
either /n/, /n/, /1/, or /1/, venuaely ‘cadibaasai to 
(C)VC,C,u. 


pen pennu pen 
pen pennu woman 
kan kannu eye 

un unnu eat 

pal pallu tooth 
vil villu bow 

kal kallu toddy 
el ellu sesame 


Final -m corresponds to final -n in some polysyllabic 
words. 


kotam koton pot, vessel 
patam paton picture 
ti:rkkam ti:rkkon valor 


In the following items (C)Vm corresponds to (C)Vmmo. 


nam nammo our 
um ommo your Cf. (19) 


In one other item V:m corresponds to CV:mo. 
na:m ma:mo we 


In most other themes the final -C, corresponds to -Cyu 
For exceptions, cf. 3.2, 3.4. 


pa:l pa:lu milk 

to:1 to:lu skin 

a:l a:lu person 
ma:n ma:nu deer 

anil anilu squirrel 
katal katalu sea 

pensil pensilu pencil 

u:l u:lu fate 

uyir uyiru life 

tayir tayiru yogurt 

avar avaru he (remote) 
ivar ivaru he (proximate) 


katavul katavlu god 
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2.2 Non-final consonant correspondences. 
(9) CT has no intervocalic -tt-. LT medial -tt- regularly 
corresponds to -tt. 


a:ttil a:ttle’ in the river 

ka:ttil ka:ttle in the air 

u:ttu u:ttu spring, fountain 
ma:ttu ma:ttu change 

e:ttu e:ttu drive 

uttu uttu (looking with) keenness 
vitta:n vitta:n he sold 

vatte vatte to dry 


(10) Medial -tC)- corresponds to -C,C}-. 


kati kaéi party, association 

patci pacééi bird 

ke:tke ke:kke to hear 

utka: rrnta:n okka:nta:n he sat Cf. also (15, 19) 
(11) Medial -kk- after LT -y- or -i- regularly corresponds 

to -kky-. 

asaykke asekkye to shake 

atikke atikkye to beat 

patikke patikkye to learn 

pataykke patekkye to offer to gods 


(12) Intervocalic nnV, 11V, 1IV, and vvV regularly correspond 
ton, 1, 1, and v before consonants. 


kannil kanle in the eye (locative) 
kolluva:n kolva:n he will kill 
kolluva:n kolva:n he will have 
evvalavu evlavu how much 


(13) Medial -1- before nt regularly corresponds to zero. 


avulntu avuntu untied 
kavulntu kavuntu toppled 


(14) Medial y after -a- in polysyllabic words regularly cor- 
responds to zero; the -a- appears as -e- Cf. also (3). 


ataykke atekkye to shut 
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(15) 


(16) 


(17) 


(18) 


to break 
he offered (to god) 


utaykke 
pataytta:n 


otekkye 
patetéa:n® 
Medial -rr- corresponds to zero before CC, or nasal 
plus homorganic consonant in the items below. 


utka:rrnta:n okka:nta:n he sat 
nirrkkirra:n nikkrra:n he is standing 
pa: rrkkirra:n pa:kkrra:n he is seeing 
pa:rrtta:n pa:tta:n he saw 


In a few other items medial -rr- corresponds to -l- be- 
fore geminates. 


parrppoti palppoti tooth powder 
karrttu:n kalttu:n stone pillar 
karrcatti kalééatti stone vessel 


C,VvCc, corresponds to c,C, in some items. 


kannil kanle in the eye Cf. also 3.4 

o:tina:n o:tna:n he ran 

ca:ttina:n éa:ttna:n he pointed to him (for 
something) 


Initial m- corresponds to v- in the following word. 


mi:say vi: se mustache 


2.3 Vowel correspondences. 


(19) 


/i/ and /u/ in the first syllable of polysyllabic words 
correspond to /e/ and /o/ respectively before -a- in the 
next syllable. The -a- before final LT -m becomes -o- 
before final -n; cf. (27). Elsewhere it appears as /e/. 





itam eton place 

inanku enepku agree 

tirrappu terreppu opening 

pirrappu perreppu birth 

pirantay perente a creeper 

ulakam olekon world 

urranku orrenku sleep 

kuranku korenku monkey 

pulayppu poleppu a living Cf. also (14). 
vilay vele price Cf. also (3). 














(20) 


(21) 


(22) 


(23) 


(24) 


(25) 
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kilay kele branch 
kulay kole killing 
kutay kote umbrella 


Initial LT i- corresponds to CT zero in the items below. 


impa mpa a call for cows 

inta nta expression used 
when giving some- 
thing to someone 

irantu rentu two Cf. also (19). 


In one other instance e: corresponds to ya: 
e:n ya:n why 


The above correspondences are not found in the word 
ivar ‘he’, and before geminate consonants. 


ivar ivaru he 

ivarukku ivarukku to him 
kupparre kupparre upside down 
kuttacci kuttaé¢i short girl 
kuttay kutte small pond 
tillay tille name of a place 
pillay pille child 

killay kille parrot 


Medial -i- is lost in CT after 1 or rr plus geminate tt. 
Cf. also (9). 


va:lttina:n va:lttna:n he praised 

ta:lttina:n ta:lttna:n he lowered 

nakarrttina:n nakattna:n he moved (trans.) 
nakarrttna:n 

patarrttina:n patattna:n he spread (trans.) 


patarrtna:n 
Medial u is lost in these two items. 


tamilukku tamilkku to or for Tamil 
katavul katavlu god 


C,iC2- correspond to C)C.,- in the following words. 


pila:vu pla:vu jack tree 
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mila mila:vu an animal 
kita:vu kta:vu buffalo 
kilappu klappu hotel 


(26) In many polysyllabic words the vowel in the second syl- 
lable corresponds to zero. 


pa:kkiyam pa:kkyon personal name 
o:tina:n o:tna:n he ran 
pa:tina:n pa:tna:n he sang 
evvalavu evlavu how much 
kolluva:n kolva:n he will have 
kolluva:n kolva:n he will kill 
kannil kanle in the eye 

ma: ppillay ma: pple son-in-law 
kattina:n kattna:n he cried 


(27) /-a-/ before final -m ordinarily corresponds to /o/ be- 
fore final -n. This vowel change is connected with final 
consonant correspondence (5). 


kotam koton pot 
patam paton picture 
ti:rkkam ti: rkkon valor 


(28) Final -an corresponds to -en (phonetically [@]) in the 
items below. 


avan aven he 
éantiran éantren moon 
ivan iven he 
makan maken son 


(29) -an corresponds to -on in the following single item. 
katan katon debt 


(30) Final -u corresponds to final -e in these two items. 


inku inke here 
enku enke where 


2.4 Some correspondences involve sequences of consonants 
and vowels. Correspondences (3), (14) and (5), (27) might have 
been listed in this category. Others are: 





a 





ee ee 











t 
] 
] 
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(31) Medial ak corresponds to -o-. 


marumakan marumoen son-in-law 
marumakal marumoa daughter-in-law 


(32) In the following words final -lutu corresponds to zero or 
final n ( [~] phonetically). 


ippolutu ippo~ippon now 
appolutu appo”~ appon then 
eppolutu eppo~ eppon when 
(33) In two other instances CT has final -vu where LT has 
zero. 
mila: mlavu: an animal 
kita: kta: vu buffalo 


3. Tamil nouns are inflected for gender, number and case 
in that order. The arrangement of the suffixes in an inflected 
noun may be represented by A B C DE, where the symbols have 
the following values: 


A stem 

B_ stem formative if any 

C gender suffixes if any, neuter is unmarked 
D number, singular is unmarked 

E case suffixes if any 


Morphemes B-E are all suffixal morphemes occurring in 
the same relative order in LT andin CT. The two systems 
show differences, however, both in the shape of individual mor- 
phemes and in morpheme distribution. Some of these are il- 
lustrated in the section below. 


3.1 Stem formatives.? Certain classes of nouns have 
morphs between the stems and the gender suffixes to which no 
meaning can be ascribed. These we call stem formatives; in 
Hockett’s terminology they are empty morphs.!° LT has -tt-, 
-€&é-, and -tt- as stem formatives. All of them occur in CT. 
However, the distribution in CT differs from that in LT. 

In LT, nouns in the neuter class which end in -am have the 
stem formative -tt- in the singular before case suffixes and this 
stem formative does not occur in the plural with or without case 
suffixes. But in CT this stem formative is found both in the 
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singular and in plural before case suffixes. In the plural with 
case suffixes the alternant form without the stem formative is 
also used. When there is no case suffix there is no stem form- 
ative either in the singular or in the plural. 








Stem 8g. with pl. without pl. with 
case suff."! case suff. case suff. 
LT maram marattay marankal maralkalay tree 
CT maron maratte maranko marattuole 
= or 
marankale 
LT patam patatty patankal patankalay picture 
CT paton patatte patanko patattuole 
or 
patankale 


The other stem formatives -té- and -tt- have similar dis- 
tributions in LT and CT. 


3.2 Gender. Neuter is unmarked bothin LT and CT. The 
masculine gender suffixes in LT are an, a:n, and # (zero). Ex- 
cept for the phonologically conditioned morphophonemic altera- 
tions there is no difference whether they occur finally or non- 
finally.'* In CT the suffix -an has the allomorph -en in final 
position, -an- in non-final position. Cf. correspondence (28). 
Thus, LT forms to:lan, to:lanay ‘friend (male)’ correspond to 
CT forms to:len to:lane. _ 

The same type of difference as in the masculine is found in 
the distribution of the feminine suffixes. The feminine suffixes 
in LT are al, a:l, i, and §, and they are the same whether they 
occur finally or non-finally. Allomorphs of the suffixes al and 
a:l in CT are conditioned by position: the form in final position 
drops the -1. Thus, LT forms makal, makalay ‘daughter’ cor- 
respond to CT maka, makale; and LT akka:1, akka:lay ‘elder 
sister’ correspond to CT akka:, akka:le. 


3.3 Number. Singular is unmarked both in LT and CT. 
The noun plurals in LT may be classified into four classes ac- 
cording to the suffixes they have. The plural suffixes in LT 
are kal, kkal, ar ~arkal, and a:r ~a:rkal.'* There is no dif- 
ference whether these suffixes occur finally or non-finally. 








Ie 


II 


II 


II 


II 





Lre 
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In CT the nouns fall under eight different classifications. 
The plural suffixes are different whether they occur finally or 
non-finally. The two systems are tabulated below illustrating 


the difference. 


Numbers I, II, Ill, and IV in the following 


classification indicate the classification of the nouns in LT and 
the subdivisions a, b, c within the numbers indicate the classi- 


fication in CT of that particular class in LT. 


la LT 
CT 
ih iT 
cT 
Ic-9 df 
CT 
Ila LT 
CT 
Illa LT 
CT 
IIIb LT 
CT 
ite? LT 
CT 


Final 
-kal 
marankal 
-ko 
maranko 


-kal 
-ko 
makko 


-kal 
ma:navikal 
-yo 
ma:noviyo 
-kkal 
pu:kkal 
-kko : 
pu:kko 


-ar ~ -arkal 
avar ~ avarkal 
alko ~ anke 
avanko ~ avanke 


-ar ~ arkal 

avar ~ avarkal 
-uo 

avanuo ~ avaluo!* 


-ar ~ -arkal 
ma:navar 
ma:navarkal 
-uo ~ ko ~ ke 
ma:navanuo 


Non-final 


-kal- 
maralkalay 
uol ~kal ~kl 


tree 


marrattuole ~ marankale ~ marankle 


-kal- 

makkalay 

kal ~ kl 

makkale ~ makkle 


-kal 

ma:navikalay 

yol 

ma:naviyole 

-kkal 

pu:kkalay - 

-ol- ~ -kkal- ~ -kkl- 


pu:ole ~ pu:kkale ~ pu:kkle 


-ar- ~ -arkal- 
avar ~ avarkal 
-kal- ~ -kl- 
avalkale ~ avankle 


-ar- ~ -arkal- 
avar ~ avarkal 
-uol- 

avanuole ~ avaluole 


-ar ~ -arkal- 
ma:navaray 
ma:navarkalay 
uol ~ kal ~ kl 
ma:navanuole 


child 


student (fem. ) 


flower 


they (no gender) 


they (masc.) 


they (fem.) 


student (male) 
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Final Non-final 
ma:navalko ma:navankale 
ma:navalke ma:navankle 
IVa LT -a:r ~ -a:rkal -a:r- ~ -a:rkal 
na:ta:r ~ na:ta:rkal na:ta:ray ~ na:ta:rkalay name of a 
caste 
CT -uo ~ ko ~ ke -uol- ~ -kal- ~ -kl- 
na:ta:nuo na:ta:nuole 
na:t&: Dko na:ta:jkale 
na:t&:jke na:ta: jkle 


Except for the free variations in III and IV the plural suf- 
fixes in LT are always kal or kkal. In CT, however, the non- 
final allomorphs of the plural suffixes always end in 1, and those 
of final position end in either -o or -e. 

The demonstrative pronouns avan ‘he’ (remote), ivan ‘he’ 
(proximate), the interrogative pronoun evan ‘who’, and the noun 
makan ‘son’, have three forms in LT: masculine sg., feminine 
sg., and plural without gender. The plural may refer to mas- 
culine plural, feminine plural, or both. In CT, however, they 
have five forms: masculine sg., feminine sg., plural without 
gender, masculine plural, and feminine plural. They are given 
below: 


LT 
masc. 8g. fem. ‘Sg. Pi. no gender 
avan aval avar ~ avarkal 
ivan ival ivar ~ ivarkal 
evan eval evar ~ evarkal 
makan makal makkal 

CT 
masc. 8g. fem. Sg. pl. no masc. pl. fem. pl. 

gender 
aven ava avajko avanuo avaluo 
iven iva ivajko ivanuo ivaluo 
even eva evanjko evanuo evaluo 


maken maka makko makanuo makaluo 
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3.4 Case morphemes. Differences in the case suffixes be- 
tween the two systems are confined to the shape of cognate mor- 
phemes. The correspondence in the accusative case suffix (-ay 
corresponding to -e) has been covered under the general phono- 
logical changes (3). The rest of the case suffixes, which do not 
come under the phonological correspondences are tabulated be- 
low in the two systems. 


LT CT 
Inst. -a:l -a:le 
maratta:1 maratta:le by the tree 
Conj. -o:tu ~o:te 
avano:tu avano:te with him 
Dat. -kku -kki after front vowels /i/ or /e/ 
in the preceding syllable; 
-kku elsewhere 
panaykku panekki to the panay tree 
manikku manikki to Mr. Mani 
avanukku avanukku to him 
Abl. -iliruntu -leruntu 
marattiliruntu marattleruntu from the tree 
Gen. -utaye Bes -o:te 
ennutaye enno:te my 
Loc. -il -le 
marattil marattle in the tree 
‘among"®  -il, -ukkulle -le 
marankalil marankalle 


marankalukkulle marattuolle 


Similar differences both in the shapes and in the distribu- 
tion of the morphemes are found in the verbal system also. 
teyta:y ‘you did’ and pirinta:y ‘you separated (intransitive)’ 
which have the same past tense suffix -t- come under a single 
class in LT. But in CT they come under two different classes 
since the first one has the past tense -t- whereas the second 
one has the past tense suffix -c-: teyte ‘you did’, pirifitée ‘you 
separated’. There is, in fact, no past tense -c- suffix in LT. 


4, Conclusion 
The above data, incomplete as they are, should be suffi- 
cient to illustrate the distinct nature of the two styles. From 
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the point of view of history they seem to represent two different 
stages of development of the language existing side by side. A 
complete explanation of the historical relationship between the 
two will require more comparative study and above all addition- 
al information on local dialects which have not been studied so 
far. The results of such a study, aside from being of interest 
to historical linguists, should also provide valuable informa- 
tion regarding population movements and social change. 

The existence of the two styles also has a number of more 
practical implications. Aside from the features illustrated here 
there are of course a number of additional differences in vocab- 
ulary and syntax. Acquisition of literacy in Tamil therefore 
requires much more than learning the alphabet. It is more like 
learning a second language. This fact has not been considered 
so far in the teaching of literacy and the language arts. Teach- 
ing materials which take account of the realities of spoken and 
literary styles could effect great improvements in the efficiency 
of present methods. 


Notes 


1. I would like to express my gratitude to The Rockefeller 
Foundation for a fellowship which enabled me to study linguis- 
tics at Cornell University, Ithaca (1956-57) and at the Univer- 
sity of California, Berkeley (1957-58) and particularly to Dr. 
John J. Gumperz, University of California, Berkeley, who sug- 
gested and helped in the preparation of this paper, and Dr. Mur- 
ray B. Emeneau, who read the first draft. 


2. Sir George A. Grierson’s Linguistic Survey of India, 
Calcutta, 1927, did not cover the Tamil speaking areas of 
Madras State and Travencore Cochin. Some features of Tamil 
caste dialects are described in Jules Bloch, Castes et dialectes 
en tamoul, Mémoires de la Société de Linguistique 16. 1-30 
(1909). In regard to the. uniformity ‘of speech among educated 
Tamils, note Bloch’s statement that special features of Brah- 
min speech are dropped in contacts with non-Brahmins (ibid. 


29). 


3. There are two recent publications dealing with the 
phonemics of Tamil; Murray Fowler, The segmental phonemes 
of Sanskritized Tamil, Language 30. 362-7 (1954), and Gordon 
H. Fairbanks, Frequency and phonemics, Indian Linguistics 








17. 105-13 (1957). Fowler’s analysis (from here on referred 
to as ST) seems to represent a more formal variety of Tamil, 
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closer to the Pandit style. His system consists of five unitary 
vowels /i e a o u/, occurring as simple vowels or geminates 
and 22 consonants /pbtdtdkgmnnllvcrxhwyrl/. 
Fairbanks’ system represents the CT as given here. In the 
footnotes to his paper Fairbanks has pointed out some of the 
differences between his and Fowler’s analysis. We will here 
list those differences not mentioned by Fairbanks. 

ST does not have /ze/. In LT and CT /z/ is found in words 
borrowed from English: /beenk/ ‘bank’, /beet/ ‘bat’. /h/ in ST 
corresponds to [x~y] in LT and CT, an allophone of /k/. 

The contrast between /x/ and /c/ in ST [k’axcid] ‘a meas- 
ure of weight’, and [k’att fi] ‘girdle’ is not found in CT for the 
simple reason that /kax¢u/ is a literary word, not now in use. 
In LT the above are [k’ahs#] and [{k’aéé’t] where [h] may be 
analyzed as an allophone of /k/ occurring before fricatives, 
the allophones of the respective stops and before retroflex flap 
[r] allophone of /t/. Other citations illustrating this allophone 
are [?ahd4] ‘that’, [? ahxi] ‘shrink’ etc. All these forms in- 
cluding [k’Ahsi] are found in ancient literature and not used in 
modern Tamil. 

The phonemes /f/ and /¢/ are found only in borrowed words, 
/f/ in words like /film/ ‘film’, /ka:fi/ ‘coffee’, /so:fa/ ‘sofa’, 
etc. all borrowed from English and /g¢/ only in the word /cinti/ 
‘Hindi’. In the speech of the uneducated /cinti/ is /inti/. 

The ST continuant /r/ is analyzed as the geminate of the 
flap /r/. 

ST has only three nasals /m n n/. Both in LT and CT six 
nasals have to be recognized, (or five if a different solution is 
adopted). All the six nasals occur initially when referring to 
the letters in the Tamil alphabetic system. 


LT CT 

makaram ma:na: the letter m 
nakaram na:na: the letter n 
nakaram na:na: the letter n 
nakaram na:na: the letter n 
fiakaram fla:na: the letter fi 
Nakaram Na:na: the letter n 


Children are taught to distinguish these sounds in school. Of 
these, /mnn/ contrast in many environments. /fi/ occurs 
only in the above example and in fia:yirru ‘sun, Sunday’, con- 
trasting with nya:yam ‘justice’; it might also be analyzed as 
/ni/, thus reducing the number of nasal phonemes to five. /n/ 
contrasts with n before /k/ e. g. ponkay ‘golden hand’ tankay 
‘younger sister’. nis found only in the name of the letter 
quoted above. 


4. Phonetic values of the above symbols are given in 
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Fairbanks (ibid). The Fairbanks analysis has seven nasal vow- 
els /@ @: &: 3 3: & &:/ occurring finally. These are analyzed 
as -Vnhere. This is possible since we set up the two styles as 
two distinct systems. However, if we were to present an over- 
all analysis of Tamil, we should have to set up the nasal vowels 
as separate phonemes. 


5. In all the statements and in the examples that follow the 
LT forms are given first and then the CT forms. 


6. The statements of phonological correspondences are 
numbered consecutively in parentheses (1) to (33) through the 
subsections 2.1 - 2.4. 


7. -il and -le are the LT and CT locative case suffixes 
(cf. 3.4). 


8. -tt- and -éé- are the LT and CT past tense suffixes. 


9. The term ‘stem formative’ is an extension of the term 
‘formative suffixes’ used by M. B. Emeneau, Kolami - A Dra- 
vidian Language (Berkeley, 1955). These suffixes could be 
analyzed as part of the stems or as part of the suffixes that fol- 
low. But the existence of classes of nouns without these suf- 
fixes suggests the segmentation of these morphs though no 
meaning can be ascribed to them. 





10. C. F. Hockett, Problems of morphemic analysis, Lan- 
guage 23. 321-43 (1947). 


11. The examples are given only with the accusative case 
suffixes. The situation is the same for all the cases. 


12. Before velar consonant /n/ has the morphophonemic 
alternant /p/. 


13. In old Tamil kal] is the plural suffix in neuter nouns and 
ar, a:r are the plural suffixes in non-neuter nouns. In LT both 
forms ar~arkal and a:r~a:rkal are used though traditional Pan- 
dits would still insist upon the use of only ar and a:r. 


14. The -an in avanuo is the masculine suffix; -al- in 
avaluo is feminine suffix; an in avanko ~ avanke should be 
historically the masculine suffix. But when the form avanuo 
was used for the masculine plural the form avanko ~ avanke lost 
its reference to masculine gender and is now used for the plu- 
ral without gender, either masculine, or feminine or oth. 
Hence anko ~ anke is taken to be the plural suffix. 


15. The word ‘among’ is not a case name. It gives the 
meaning of this case form, which occurs in the plural. 

















LITERARY AND COLLOQUIAL BENGALI IN 
MODERN BENGALI PROSE 


Edward C. Dimock 
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1. Introductory 
2. Linguistic differences 


3. History of the controversy 


1. It has long been recognized that there are at least two 
distinct forms of the Bengali language, the sadhu - bhasa 
(literary language, hereafter SB) and the calit - bhasga (the col- 
loquial language, hereafter CB). This condition of having two 
different forms of a language, with the distinction between them 
relating to prestige or other extra-linguistic values, is not an 
unfamiliar one. Bloomfield mentions it in his description of 
language community.! Historically and within our own culture, 
we need look no farther than to the relationship between Latin 
and the vernaculars in medieval Europe.* And, on a somewhat 
different level within our own contemporary society, there is 
the prestige-giving value of certain types of usages, pronuncia- 
tions, and “accents”, among certain groups of people. With 
this we are all familiar and I am sure that examples need not 
be multiplied. 

There is, however, one aspect of this general type of lin- 
guistic situation which might not be familiar to English-speaking 
people: that for a given language there might be one form used 
almost exclusively for literature and another form used almost 
exclusively on the conversational, social-intercourse level. 
Yet this is a feature of a large number of the languages of the 
world, including Bengali. A situation of this type in Arabic, 
even more radical than that in Bengali, has been recently de- 
scribed by Al-Toma.? He establishes certain categories for 
the definition of the distinction between two forms of Arabic, 
and these categories might be adopted by us here, in order to 
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open our discussion of the similar distinction in Bengali. 

In the first place, Al-Toma states that Classical Arabic is 
the language of formal communication, and that its usage is 
determined by “ situations which have the character of formal- 
ity.” His second generalization is that Classical Arabic is 
uniform throughout the Arabic-speaking world. And, thirdly, 
he says that Classical Arabic has more prestige than does the 
colloquial. The Bengali SB also has these characteristics; but 
to make the statement entirely accurate for Bengali, a few 
modifications would have to be made. In the first place, it must 
be understood that pure SB, unlike Classical Arabic, is rarely 
spoken.* Al-Toma mentions as “situations with the character 
of formality” lectures, speeches, formal addresses, and the 
like. In Bengali, what would be used in these types of situations 
would be formal or perhaps SB vocabulary with predominantly 
colloquial pronunciation and grammar. This almost exclusively 
written character of the SB should be borne in mind when think- 
ing of the universal uniformity of the SB throughout the Bengali- 
speaking world. In the second place, although it is possible in 
spoken Arabic to have a mixture of Classical and colloquial, in 
written Arabic Classical and colloquial are never mixed. As we 
shall see, the situation is quite different for written Bengali. 

It might be put this way: Classical and colloquial Arabic 
may be considered in some sense as two different languages, 
with clear and linguistically definable differences between them. 
The distinction between SB and CB is not as clear-cut. SB and 
CB might be better thought of as opposite poles of the same 
language. ° 

Pure SB is the recognized norm for the writing of the lan- 
guage, the peculiarities of which are respected and understood 
wherever Bengali is written or read. The CB, on the other 
hand, is first and foremost the standard spoken language of 
Calcutta. It is not merely another regional dialect.® It is by no 
means limited to Calcutta or to the area immediately surround- 
ing that city, but is used and understood throughout an area 
which crosses over many dialect boundaries. In relatively re- 
cent times there has been a movement within the Bengali lan- 
guage toward the fusion of the two forms. In modern Arabic, 
Iam told, only the most daring of dramatists allow their char- 
acters to speak in the colloquial form, even when the characters 
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might be of a category in which the speaking of Classical Arabic 
would be absurd from a realistic point of view. This situation 
prevailed in Bengali literature about half a century ago.’ It 
does no longer. Although there had been writers before him 
who had used the CB for serious literature, ® it can perhaps be 
fairly said that Rabindranath Tagore began a trend which con- 
tinues strongly in the present day. 

Tagore is fond of illustrating social and cultural situations 
with little fables. He tells such a fable to point out the increase 
of the use of CB in literature.? The story is that of a king who 
had two queens. One was sophisticated, immaculately groomed, 
and utterly regal. The other, of course, was simple, straight- 
forward, and charming in her unpretentiousness. I am sure 
that everyone has read enough fairy-tales to be able to imagine 
the course of the story. At first, the younger, un-queenly wife 
was scorned, but gradually she began to charm the king and to 
win his favor. And Rabindranath says, “The end of the story 
has not yet come; but there will be a day when the younger will 
sit upon the throne alone.” There might be some who would 
question the conclusion, but I am quite sure that the analogy is 
true enough. 


2. If one considers that the SB and the CB are at the two 
opposite poles of the same language, it is clear that descrip- 
tively there is an infinite number of points between them. Many 
of these points are occupied by individual writers whose work 
represents the state of the language at some phase in the history 
of the movement of the two poles toward each other. If one 
could graph the accepted norms of the SB and CB over the last 
century, I suspect the graph would show a truncated pyramid. 
Starting from the two extremes, the lines would show a gradual 
merging. The graph, however, is incomplete. The apex of the 
pyramid, the identity of the SB and CB, lies somewhere in the 
future. It is clear that over the last century the two extremes 
have been moving more and more toward the center. At the 
present day, they are still definably distinct, but are consider- 
ably closer together than they were a century ago. 

Tagore points out that in some ways the CB is replacing 
the SB, but that at the same time the CB is taking on some of 
the characteristics of the SB.'° One of the examples he uses. 
is this: some years ago, a man of the servant class using the 
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term / spekkha/!°* (“expectation of or waiting for someone or 


something”) would have been considered laughably absurd. This, 


Tagore says, is no longer so. Although / spekkha/ is a purely 
SB term, it is becoming entirely acceptable for all people. 

It is clear that if this study were to be a complete one, it 
would not only have to be descriptive of the present state of the 
two forms of the language, but it would have to trace the move- 
ment of the two forms toward juncture over a period of time. 
Account would also have to be taken of the fact that from the 
point of view of modern Bengali writers, the controversy be- 
tween SB and CB is still raging. Although in some speech and 
in most writing CB and SB have come close together, there are 
still ardent and powerful apologists for the extremes of both. 
Buddhadeva Bose, for example, a most influential writer and 
critic in modern Bengal, feels very strongly that modern crea- 
tive literature should be written entirely in the spoken language, 
with care taken to rid it of any influence of the SB. As we shall 
see, there are apologists for the other extreme, as well. 
Tagore represents the middle ground, which is probably also 
the realistic ground, that there is no need for concern over the 
blending of CB and SB. He feels that their blending is a natural 
linguistic development, and what once was a literary crime of 
major proportions, the mixture of CB with SB, can no longer be 
considered such. !4 

Descriptively, the differences between the SB and CB forms 
of the language can be summarized under four general catego- 
ries: lexical, phonological, morphological, and syntactic. 


2.1 Lexical distinctions. On the basis of a word list de- 
vised by Professor Gordon H. Fairbanks,” it was found that, 
of 511 items counted, 39.5% have correspondence (i.e., that 
the item is identical for CB and SB), and 5.4% can be used 
optionally in either CB or SB (i.e., for a given English gloss 
there are two Bengali meanings, one differing from CB to SB, 
the other corresponding). Therefore, 55.1% of the lexical 
items counted always have differing forms in CB and SB. It is, 
however, probable that these figures are descriptively less 
than accurate. When the forms were elicited, I had in minda 
definition of the breadth of the distinction between CB and SB 
rather than description of the actual current situation. In all 
likelihood, these figures represent the point at which CB and 
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SB are farthest apart, the point which might have been descrip- 
tively accurate 75 or 100 years ago. Since I was aiming at con- 
trasting forms, it is more than possible that my informant has 
given obscure SB forms, forms which are possible but used only 
rarely, in places where in present-day writing a CB form might 
more ordinarily be used. Thus the category of optional forms 
would probably be far greater than the present figures indicate; 
indeed, this would be reflected by Tagore’s statement that the 
actual lexical differences between SB and CB are diminishing. 

It might even be said that to a large extent in modern writing a 
choice between a CB and an SB form is stylistic rather than 
categorical, and would depend on whether the subject matter 
was heavy or light and on the author's choice of treatment of it. !* 


A few examples of CB-SB lexical differences are the fol- 
lowing: 


SB form CB form gloss 
hongso hag goose 
brikkho gat tree 
putro chele son 
naSika nak nose 
topto gorom hot 
dongon kora kamrano (to) bite 
agomon kora asa (to) come 
dan kora deoa (to) give 
bristi hooa jol hooal® (to) rain 
opekkha kora darano!® (to) wait for 


- It is to be noted that certain of the items above, such as the 
first and fourth, seem to show a relationship between SB and CB 
forms. About 18.5% of the total number of forms show such a 
relationship. The historical relationship between, for example, 
/hon8o/ and /ha8/ is that /has/ represents a normal develop- 
ment of the Sanskrit hamsa through the Prakrits, while the SB 
form /hon3o/ represents a re-borrowing by Bengali from the 
Sanskrit. This is by no means an unusual linguistic phenome- 
non; grammarians call the direct, popular descendant form 
tadbhava and the learned borrowing tatsama. 17 The large num- 
ber of CB-SB correspondences of this type in Bengali could, I 
think, be described in a very few statements. 
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2.2 Phonological distinctions. As the SB is not a spoken 
language, phonological differences between it and the CB are 
not significant. Even so, it is interesting to notice that there 
are people who will pronounce SB differently from their ordinary 
speech when reading an SB text aloud. The most frequent and 
noticeable phonetic differences are in terms of consonant clus- 
ters. For example: 


SB form CB form gloss 
loksmi lokkhi Laksmi 
smoron 83ron memory 


If a general statement were to be made, it might be said 
that the SB phonologically is closer to the orthography than is 
the CB. For example, as S. K. Chatterji indicates in the in- 
troduction to his Bengali Reader, what is sometimes called 
the “short a” vowel (phonetically), is, under certain cir- 
cumstances in the CB, phonetically [o]. In SB pronuncia- 
tion this vowel tends to remain[2]. There are many other 
cases of this kind. Orthographically there are three sibi- 
lants, dental, palatal and retroflex, which some people 
attempt to differentiate when reading SB. Phonemically 
in the CB there is only one sibilant. Further, there are sym- 
bols for both dental and retroflex nasals, which some people at- 
tempt to differentiate; phonemically there is only one dental. 
There are several symbols for the velar nasal. The -Cy- clus- 
ter is phonemically -CC-, etc. This is an area of the study 
which to my knowledge has not yet been adequately described. 





2.3 Morphological distinctions. Although lexical items 
represent the major difference between the SB and the CB, sig- 
nificant differences occur on the morphological level. These 
morphological differences are primarily in verb forms and in 
certain noun endings. Both of these types of morphological dif- 
ferences can be demonstrated by a translation of the simple ex- 
pression “He has given (it) to us”; 


SB /amadigoke dan koriyache/ : CB /amader diyeche/ 


These differences are firm and consistent throughout the lan- 
guage. ASB verb form is never used in a CB environment, 
and a SB substantive suffix is rarely if ever used in a CB 


























49 


environment. The distinctions can be summarized simply: 


a. Verb forms (Synopsis of lst person forms for all tenses): 





Tense SB form CB form 
Simple present éoli foli 
Simple past €slilam éollam 
Conditional éslitam éoltam 
Simple future Eslibo Eolbo 
Pres. continuative Edlitechi Eolchi 
Past continuative €éslitechilam éolchilam 
Present perfect Esliyachi Eolechi 
Past perfect €sliyachilam Eolechilam 
Infinitive €olite Eolte 
Conjunctive Eslivya Eole 


b. Noun endings. The most striking difference is the form 
of the animate plural suffix -ra when followed by a case ending: 
in SB it is -dig(>)- and in CB it is -de-. Thus, using the first 
person plural pronoun as an example: 





Case SB form CB form 
Nominative amra amra 

Genitive amadiger amader 

Objective amadigoke amader, amadike, 


or amaderke 


There are differences, also, in the definitives and pluralizing 
suffixes of nouns. 

There are other morphological distinctions. Most distinc- 
tions, however, have either disappeared or are disappearing in 
the blending of SB and CB with each other. For example, people 
writing in SB today pay little if any attention to such considera- 
tions as gender, as the pandits once did. This is also true of 
compound words. SB was once full of compounds, but it is now 
usual to eliminate sandhi, break down compounds into their 
component parts, and to treat these component parts as sepa- 
rate entities."* It is also generally true that such prefixes as 
/su-/, /ku-/, /9-/, which indicate quality or negativeness, are 
rarely found in CB constructions, being considered character- 
istic of SB. 
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2.4 Syntactic distinctions. Since to my knowledge not much 
work has yet been done on Bengali syntax, it is a littie difficult 
to make a systematic statement about it here. A few generali- 
zations, however, might be of some value. 

It is possible to consider such verbal compounds as /dan 
kora/ and />pekkha kora/ as syntactic phenomena. Such com- 
pounds occur in far greater number in SB than in CB. Of the 
142 verbs in the list mentioned above, 57 forms were compound 
in both SB and CB; and 13 were simple in both SB and CB. In 
no instance did CB show a compound verb where the SB verb 
was simple. 

A second generalization might be that word order in SB con- 
structions is more rigid than it is in CB.!% Almost invariably 
in SB constructions, the verb form is final in the sentence. In 
CB, however, shades of meaning and emphasis can be trans- 
mitted by varying the word order. These shades of meaning 
and emphasis are lexical functions inSB. Compare the two 
versions of the sentence “ The two brothers loved each other 
very much”: SB /ubhoy bhratar porom(5) sompriti chilo/ : 

CB /khub bhab chilo du bhaier/. According to Sen,™ the initial 
and final expressions in the CB are the emphatic ones. In the 
case above, the SB compensates for this by the use of the strong 
terms /porom(2)/ and /sompriti/. 


3. The Bengali vernacular language, both from the point 
of view of prestige and the point of view of literature, was ina 
low state at the beginning of the nineteenth century. Bengali 
pandits much preferred to write in Sanskrit; when, as at Fort 
William College, they had to write Bengali, they rationalized it 
by saying that at least Bengali was “the best of the vernaculars, 
because of the preponderance of Sanskrit in it.” It can be 
gathered from this, I think, that the Bengali which these pandits 
wrote was not the Bengali which was the spoken language of the 
common people. 

Strangely enough, the only people who were paying much 
attention to the vernacular at that time were the missionaries. 
It was the custom not many years ago to give the missionaries 
perhaps more credit than they deserve for the revival of ver- 
nacular Bengali and for the part which they played in raising 
the vernacular to a position of dignity. In the same way, it is 
the fashion now to debunk the idea that the missionary role 
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in the revival was significant. I think that it cannot really be 
questioned that their role was a significant one, but I think it is 
also true that this significance was not necessarily the result of 
a profound concern with the beauties of the language. There 
were certainly among the missionaries men who had a genuine 
love for the language and its literature.** Even had this not 
been so, however, the nature of missionary activity would have 
necessitated becoming involved in language and literature. 
These men were in daily contact with the people of Bengal; they 
had to learn the language as it was spoken, in order to carry out 
their work. Even if only from the point of view of text- books 
and propaganda, and again, in many cases it was far more than 
this, they had to learn to speak and write the language popular- 
ly, clearly, and usefully. The chief end of the missionaries in 
their use of the language was, as De says, “not to display their 
erudition”, but to make themselves intelligible. These direct 
and utilitarian ends, together with the “tendency of the mission- 
aries toward realism, directed the whole movement (of the 
Bengali revival) toward simplicity and naturalness.”?? 

From this point of view, one of the major landmarks in 
this revival of the vernacular was the opening of Fort William 
College in Calcutta in Calcutta on May 4, 1800. The purpose of 
the College, according to Lord Wellesley’s Minute, was “the 
better instruction of the Junior Civil Servants of the Company.”™ 
Actually, the College succeeded in a purpose far more signifi- 
cant in the long run than this. It contributed substantially to 
Bengali literature through the writings of its faculty and students, 
and through its emphasis on the vernacular as a language as 
worthy of study as the classical languages. The writings of the 
College faculty were published mainly at the Baptist Mission 
Press at Srirampur, which printed not only the original works 
of many Bengali scholars and writers, but also for the first 
time many of the classics of Bengali literature as well, includ- 
ing the Ramayana of Krttivasa and the Annadamangal of Bharat- 
candra. * 

On the faculty of Fort William College, perhaps the most 
important single individual was William Carey, an English 
missionary who had gained an appointment as instructor in 
Bengali. Carey, together with other instructors at the insti- 
tution, wrote large numbers of books in that language. These 
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books are, as De puts it, “far from being invulnerable from 
point of view of literary merit”. Yet, despite mistakes in idiom 
and expression — perhaps inevitable for people writing a for- 
eign language, no matter how familiar — the books which these 
men produced were for the most part simple, direct, and writ- 
ten in the language of the people. 

Perhaps it would be wise to pass over Carey’s Dialogues, 
the literary value of which seems to be questionable, and 
turn for illustration to his Itihasa-mala, the literary value of 
which, I think, is not. The Itihasa-mala is a collection of some 
150 stories taken from the Paficatantra, the Hitopadesa, and 
other Sanskrit sources, translated into simple Bengali prose. 

A few lines will indicate the style of the work. ?’ 


ekajana ghataka brahmana arthat bibaher yojaka eka 
eee madhya diya Asitechila | se sthane eka byaghra 

hataka brahmanke miarite udyata haile | brahmana 
bhita haiya krandana karite lagila | 











Here, except for the oral folk literature, are the first 
traces of literature in a style other than the ornate one which 
had become the norm among the pandits. The language of the 
Itihasa-mala can hardly be called modern colloquial Bengali. 
But I think the breadth of the gulf between it and the SB of the 
pandits will become clear when the above passage is compared 
with this one sentence from Mrtanjaya’s Prabodhcandrika:”° 








akaradi ksakarantaksaramala yadyapi paficasat 
samkhyaka kimba ekapaficasat kimba saptapaficadsat 
samkhya _parimita hauk tathapi etabanmatra katipaya 
varnavalivinyasvise gavasatah vaidik laukik samskrt 
prakrt paisacadi astadasa bhasa o nanadesiya 
manusya jatiya, bhAsavisesavasatah aneka prakara 
bhasavaicitra sastrato lokatah prasiddha ache 


The difference is such that I think that even one entirely unfa- 
miliar with Bengali will be able to see it. It is true that the 


subject matter of the two passages is quite different. But the 
difference is deeper than that. As De says, ”? 


“the genius of Sanskrit is not the same as the genius 
of Bengali; it would be a mistake to suppose that 
Sanskrit syntax should rule syntax in Bengali. ... 
The use of long drawn-out compound words, the 
occurrence of unusual phrases, and extensive bor- 
rowings from Sanskrit make it difficult for the 
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unitiated to comprehend the sentences [of this SB] at 
a glance. 


The last observation, to my mind, is rather an understate- 
ment of the case; but, in general, De’s statement is an apt de- 
scription of Mrtanjaya’s writing, which in turn is a good exam- 
ple of the SB of the pandits. It was not that Mrtanjaya and the 
other pandits could not write, and write well, in a simpler 
fashion. Quite to the contrary.” They simply preferred the 
more ornate and complex SB constructions. 

Why the pandits should have felt this way is a matter of 
speculation. Some reasons seem obvious. These were learned 
men, and Sanskrit the learned language; therefore, the closer 
to Sanskrit they could get, the more comfortable they felt. As 
is well known, the study of Sanskrit is for the pandits not a 
means to an end, but the end itself.™ It is also true that these 
groups of learned men had but little contact with the masses, 
and had, therefore, little knowledge of and contempt for the 
means of the masses of expressing themselves. Finally, there 
might have been among the pandits, as there is today among 
certain people, a trace of the view of language held by the 
Pirva-mimamsa school of philosophy. According to this sys- 
tem of thought, the relationship between the word and its mean- 
ing is natural, and not created by convention. Language itself 
is an aspect of the Eternal Truth, and must be treated accord- 
ingly. * A corollary of this is the view that for serious sub- 
jects, a language of “spiritual adequacy” is necessary. This 
is an idea, not dissimilar to some modern American literary 
criticism, which is still current in Bengal. This view of the 
necessity of a “spiritually adequate” language, together with 
the idea that the subject matter of literature must be of a cer- 
tain standard of moral quality, that literary themes must be 
edifying to be of value, results in the use of Sanskrit or of 
highly Sanskritized Bengali for any serious or creative form of 
literature. There is another view (not my own) of why the pan- 
dits reacted so strongly against the CB. The CB of the time in- 
cluded a great many Persian and Arabic words. The pandits, 
as Hindus, went to the other extreme and incorporated as much 
Sanskrit as possible into their Bengali texts. This is a view 
held by some scholars in East Pakistan. I am further told 
that the fact that there were many Perso-Arabic words in the 
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CB of the time is evidenced by the work of two of the earliest 
writers of the CB, Tekcand and Vidyasagar. The latter, usual- 
ly thought of as a Sanskrit scholar, wrote under a pseudonym 
pamphlets in the CB directed against the critics of his child 
marriage and widow- remarriage ideas. 

As can be seen from the case of the missionaries and of 
Fort William College, from at least one point of view the con- 
flict between the two forms of Bengali began as a result of the 
cultural contact of Europe and Bengal. There was no conflict 
until the vernacular had been raised in status by missionary 
activity, by original writing, and by making available in printed 
form many of the standard and classic works of the Bengali 
language. 

The contact of cultures produced another situation which 
had far-reaching effect on the language and literature of Bengal. 
Except for the missionaries and military, the European ele- 
ment in Bengal was primarily an urban-centered commercial 
element, manifested in the form of the Company. With the 
growth of the city of Calcutta as the focal-point of this commer- 
cial activity and its by-product of employment for Bengalis, the 
complex of Bengali society was substantially changed. And with 
the societal structure changed, of course, its language and lit- 
erature. Not only were there new needs for the language — 
administration, commerce, and the like, *? which necessarily 
affected its nature, but the new urban environment seems to have 
been particularly nourishing to vernacular literature. Calcutta 
grew rapidly not only as a commercial center but as an educa- 
tional center as well. Cultural activity began to come to the 
fore. As a result of these things and others, the creation of a 
written literature became for the first time a professional 
activity, the function of an urban cultural stratum. A literary 
class, a class of professional writers, was produced. This 
was the first result. 

The second was that these writers were interested in 
a kind of literature new to Bengal. They soon found that the 
traditional literary Bengali, the SB of the pandits, was not 
adequate for their new literary ideas. These new literary 
ideas were primarily prose style and the novel form. 

It is well known that the first literature of any language is 
poetic literature, if for no other reason than that poetry is 
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easier to memorize and recite than prose would be. Although 
there were prose writings in Bengali before the coming of the 
Europeans, * prose was a form perhaps more fully developed 
in English. Prose was also a form which appealed greatly to 
the Bengali creative imagination. Thus, English prose became 
the model which the Bengali writers followed, in both style and 
subject matter. One of the first novels written in Bengali was 
a social novel. * The very concept of a social novel repre- 
sented a break with traditional Indian literary ideals. This, to- 
gether with the relative realism implied by such a literary form, 
led to a reconsideration of the value of the SB as a means of 
expression, 

Furthermore, the educational and other institutions which 
were established by the British in Calcutta were naturally de- 
signed on familiar British lines. Instruction in such institu- 
tions was naturally from a British point of view. This gave 
rise to a large group of middle and upper class educated Ben- 
gali people who were oriented primarily toward the culture and 
literature of Europe, and particularly of England. The literary 
values of the consumers as well as those of the creators had 
been changed. How profound this change of values was can be 
easily seen by the most superficial examination of the Bengali 
literature of the mid-nineteenth century. The most striking 
case in point, I suppose, is that of Michael Madhusudhana Dat- 
ta, a colorful and in many ways immensely tragic man living 
between two worlds. *® 

As Chowdhury points out in his paper in this volume, these 
changes had their effect in language as well as literature. Both 
SB and CB as they are known today are products of urbaniza- 
tion. Although a form of SB existed before the growth of Cal- 
cutta as an extremely Sanskritized Bengali, there was no such 
thing as CB before the city. The Bengali language then con- 
sisted of the SB and the regional dialects. The CB is the lan- 
guage of the educated middle and upper classes of Calcutta. 
Probably, as Chowdhury indicates, it is based upon the growth 
of the city. For the reasons noted above, SB tended to become 
less Sanskritized as a result of the cultural change. It moved 
away from the exclusive possession of the pandits and came 
into the possession of the new writer class, with their new 
literary aims. Looking at it in this way, which is perhaps 
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attempting to analyze it more than the situation warrants, since 
clear lines cannot always be drawn among the various divisions, 
it might be said that there have been five distinct forms of the 
language: 


(1) The regional dialects, which of course, remain 
essentially unchanged throughout these various devel- 
opments. 37 


(2) The CB as it was a century ago, a strictly spoken lan- 
guage. 


(3) The CB as it is today, remaining primarily a spoken 
language, but being used increasingly for literature. 
As Tagore has pointed out, the CB is gradually taking 
on more and more SB elements. 


(4) The old form of the SB, the panditi-bhasa, of which 
the Prabodh-candrika of Mrtanjaya (cited above) is an 
outstanding example. 








(5) The modern-day SB which, influenced by cultural 
change and by the increasing use of the CB in litera- 
ture, is far less definable grammatically than it was 
a century ago. 


That there is a striking difference between the panditi- 
bhasa and the SB of a not much later date can be easily shown. 
Compare for example the passage from the Prabodh-candrika 
above, written in 1813, to these lines from Bankim-candra 
Cattopadhyay’s Rajsimha (1878):*® 





eka pracina katakaguli citra becite Asiya tahader hate 
pariyachila | hastidantanirmita phalke likhita ksudra 
ksudra apiirba citraguli | pracina bikrayabhilase eka 
ekakhani citra bastrabarana madhya haite bahira 
karitechila | 


Bankim-candra’s novel was written after the new concepts of 
prose and the novel had taken hold. But compare this in turn 
with a paragraph from a modern writer:”? 


amalake 4j dekhte Aasbe | patrer nam arun| nam sunei 
amalar buktite yena arun abha charie gela | kalpanay 
se kata chabi na akale | sundara susri yuba — balistha 
mathay teri gaye pafijabi — sundar supurus | 
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Even on the most superficial level it can be seen that the move- 
ment is toward simplicity and directness. The quotation from 
Bonaphul is almost pure CB. 

Buddhadeva Bose says that after Bankimcandra prose “con- 
tinued to be written in a manner artificially removed from the 
spoken language”.*® Although this is a true statement when the 
writings of Bankim-candra and his followers are viewed in the 
light of modern Bengali prose with its abundance of CB, it is 
also true that Bankim-candra represents the beginnings of the 
movement away from the unreadable stiffness of the panditi- 
bhasa. He was quite conscious of the artificiality of that form 
of the language. *! 

Although the main stream of Bengali literature in the nine- 
teenth century continued to be that of the SB (although no longer 
panditi-bhasa), there were islands in the stream. A contempo- 
rary of Bankim-candra, Kaliprasanna Sinha, wrote in the CB, 
and wrote, according to Bose, “brilliantly”. But whether it 
was due to his premature death or whether it was due to the 
fact that the literary world was not ready at that time for too 
radical a departure from tradition, the writing of Kaliprasanna 
Sinha remained merely an island in the SB stream. 42 It was, 
in fact, not until Pramatha Chowdhury, in the early twentieth 
century, that the assault of the CB on the citadels of literature 
began in earnest. In his journal Sabuj-patra Pramatha Chowd- 
hury wrote scathing and lyrically bitter critical articles against 
the accepted norm of the SB, upholding the CB as a form of the 
language worthy and suited for adoption by the literary people 
of Bengal. A few rough translations from his articles will il- 
lustrate well enough the trend and expression of his thinking:*? 








“We are the plaintiffs in this [literary] court ... we 
are only trying to reclaim our lost inheritance. Our 
opponents |i. e., the defender of the SB] know that 

possession [of literary control] is nine points of the 


law. ... They believe that the demand of our mother- 
tongue is but temporary, and that it will fade away, in 
time ...” 


“Once a custom is established, it remains through 
sheer inertia. This is the only explanation there can 
be for why things are as they are. In most people’s 
minds, the fact that something is current is proof 
enough that it is necessary that it remain current. ... 





There are those who count their language as an in- 
ferior thing, and feel that writing literature in it is 
merely “the noisemaking of vulgar people ...” 





“If there is no introduction of the human element into 
literature, it is not literature. ... Artificiality has 
no place in art...” 


“In Bengal, little boys (and others) think that if you 
put an anusvara on a Bengali word, you have Sanskrit; 
or that if you cut the anusvara or visarga off a San- 
skrit word, you have Bengali. If you cut the tail off a 
monkey, will he become a man? ...” 





The voice of Pramatha Chowdhury was a strong and an ef- 
fective one. But it was with Rabindranath that the CB found its 
place in literature. It is well beyond the scope of this paper to 
go into the development of Rabindranath’s style. That has been 
covered well elsewhere. ** Let it be said only that from his 
earliest days, Rabindranath had a deep feeling for the colloquial 
language. The letters which he wrote from Europe at the age of 
18, published under the title Yurop prabasir patra, are in pure 
CB** — as Bose says, “lively and lovely prose, the prose of 
everyday speech”. In terms of his more formal work, it was 
not until his novel Ghore baire that his CB style came to full 
realization. The novel was written in 1901-2. I think it is fair 
to say that from that time, the CB began to usurp the place 
which had once been occupied by the SB alone. 

“The end of the story has not yet come; but there will be a 
time when the younger will sit upon the throne alone.” 








Notes 


1. Leonard Bloomfield, Language 48-53 (New York, 1933). 


2. See for example Preserved Smith, The age of the ref- 
ormation 451 ff. (New York, 1920); E. de Moreau etal., 
Histoire de l’eglise Vol. XVI, 94ff. 


3. S.J. Al-Toma, The teaching of Classical Arabic to 
speakers of the colloquial in Iraq; a study in linguistic duality. 
octoral thesis for the Graduate School of Education, Harvard 
University, 1956; unpublished. 








4. The father of a friend of mine recalls instances of 
pandits and scholars of his generation speaking in SB in ordi- 
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I am told that in an article by Bankim-candra on the contribu- 
tion of Tekcand’s prose to Bengali literature, instances of such 
SB speech are recorded and are held up to ridicule as preten- 
tious and laughably absurd. I have not had access to the article 
in question. 


5. It is not unlikely that, if one were to follow out these 
parallels, it would be found that a closer comparison to the 
Classical - colloquial Arabic distinction in some respects 
would be that between Sanskrit and Bengali. 


6. The regional dialects represent, of course, the third 
major division of the language. The regional dialects are very 
highly divergent in some cases — as between, for example, 
Birvhum in western West Bengal and Chittagong in what is now 
East Pakistan. Although the question of these regional dialects 
does not relate directly to the present problem, it is interesting 
to note that the dialects too have had their influence on the form 
of the standard spoken language, and thus, indirectly, on the 
literary one. For example, the form /sathe/, “with, accom- 
panying”, although originally an East Bengal dialect form, is 
now commonly used in the CB throughout West Bengal. It is 
now at least as common as the once more usual form /songe/. 


7. See Buddhadeva Bose, An acre of green grass 20 (Cal- 
cutta, 1948). This was true in Bengali despite the fact that 
even in classical Sanskrit drama the characters spoke in Prak- 
rit when appropriate. See A. B. Keith, A history of Sanskrit 
literature 30-1 (Oxford University, 1920). 


8. Among the writers of CB who preceded Rabindranath 
are Kaliprasanna Sinha and Tekcand. See Sukumar Sen, 
Bangala sadhitye gadya 94 ff., 86 ff., (Calcutta, 1935). Accord- 
ing to M. Chowdhury, however, the work of these men might 
be better described as “including colloquialisms” than as being 
“written in the colloquial language”. For a brief discussion of 
Rabindranath’s contributions, see p. 58 below. 


9. B&angala-bhasa-paricaya 39 ff. (ViSvabhdrati, 1938). 
10. Ibid. 


10a. In this paper phonemic transcription is enclosed in 
slant lines; other Bengali material in roman letters is in trans- 
literation. 














ll, The reason for this change, Tagore says, is not only 
that class distinctions are becoming less and less clear-cut, 
but also that with the increase in education and general aware- 
ness, the subject matter of every-day discussion is changing. 
This means that terminology which was once limited to the SB, 
since it was considered that only SB terminology could be used 
for serious literature and discussion, is now entering the CB. 
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This concept, that serious or abstract thought cannot be ex- 
pressed in the CB is, as we shall see, not uncommon in Bengal. 
Buddhadeva Bose would approach the matter from the other 
direction, and say that common and homely ideas and images 
cannot be expressed in the SB. 


12. The somewhat sarcastic name for this mixture of CB 
and SB is guru-candali. The term indicates people high and 
low on the social scale respectively. 


13. The list was devised in Poona in the summer of 1955 
for the purpose of comparative study of the Indo-Aryan lan- 
guages, and not for the purpose for which I have used it here. 


14. See Sukumar Sen on Bankim-candra in Bangala sahitye 
gadya, 107 ff. 


15. The data shows that the form /bristihsoa/ is also 
common in the CB. 





16. /opekkha kora/ is also a common form in CB. These 
are examples of the optional category mentioned above. As 
before stated, the movement toward the center is from both 
directions. A great many SB forms may be used in the spoken 
CB, and, depending on the circumstances, almost any CB form 
can be used in writing the language. 


17. Cf. Bloomfield, Language 492 ff; S. K. Chatterji, 
Bangala bhasatattver bhumika 50 ff., (Calcutta, 1950). 


_ 18, For a treatment of compounds in Bengali, see Sukumar 
Sen, Bangala bhasar byadkaran 140 ff. (Calcutta, 19 ). 


19. Ibid. 
20. Ibid. 


21. From Mrtanjaya’s Prabodh-candrika, quoted in S. K. 
De, History of Bengali literature in the 19th century 278 (Cal- 
cutta, 1919). The passage reads: 

















anyanya desiya bhasa haite gaurade siya bhasa uttama 
sarbottama samskrtabhasa bahulya hetuka 


22. De (op. cit. 127) says of William Carey that he “found 
the appointment (as instructor in Bengali at the College) bring- 
ing in its train responsible duties ... and affording him an op- 
portunity to cultivate ‘the beautiful language of Bengal’”. The 
source of the quotation is not given. 


23. De, op. cit. 289. 
24. Minutes in Council at the Fort William, by His Excel- 








lency the Most Honourable Maquis of Wellesley, dated August 
18, 1800. Quoted in De, op. cit. 120. 
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25. The press at Srirampur printed works not only in Ben- 
gali, but in many other languages. In the appendix to the 10th 
Memoir (1832), it is shown that two hundred and twelve thou- 
sand volumes in forty different languages had been issued be- 
tween 1801 and 1832. See De, op. cit. 157, note 1, and J. C. 
Ghosh, Bengali literature 100 ff. (Oxford University, 1948). 


26. The Dialogues are called in Bengali Kathopakatha. The 
literary merit of the work is in question seemingly because it 
was not written with a literary end in view. As a matter of 
fact, Carey himself did not write most of it, but simply “em- 
ployed some sensible natives to compose dialogues upon sub- 
jects of a domestic nature, and to give them in precisely the 
natural style of the persons supposed to be speakers”. (From 
Carey’s introduction to the work; see D. C. Sen, History of 
Bengali language and literature 725 (2nd ed. Calcutta, 1954). 
Thus it is considered by some merely as an interesting socio- 
logical commentary. 











27. The full text of the story is given in De, History 148. 
Translation might read: 


Once there was a Brahmin match-maker (ghataka), a 
marriage-agent, who was coming through a forest. 
In that forest there lived a tiger, who was ready to 
eat this ghataka Brahmin. The Brahmin was very 
frightened, and began to weep. 


28. Quoted by De, History 217. 


Even if there were fifty or fifty-one or fifty-seven 
letters of the alphabet, there could only be so many 
different arrangements of the letters. There are 
eighteen forms of language — Vedic, ordinary, San- 
skrit, Prakrit, Paisaci (i. e., demonic speech), etc. 
Among men, the characteristics of speech vary ac- 
cording to their several countries and castes. All 
the various kinds of language are well known from 
the sastras. 


29. History 220. 


30. Compare, for example, this passage from the same 
work, the Prabodh-candrika quoted in De, History 221): 





batir nikate giya Apana strike dakila... tahar stri 
gatikriya kahila ogo ami yaite paribo na 4mar hat 
yora ache 

Going up to his house, he called his wife ... His 
wife slowly answered — I cannot go. My hands are 
joined in supplication. 


Even superficially it can be seen how much simpler the 
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language is. The most striking characteristic of this passage 
is its lack of compounds. 


31. See M. B. Emeneau, India and linguistics, JAOS 75. 
145-6 (1955). 


32. See S. Radhakrishnan, Indian Philosophy Vol. II, 389 
ff. (2nd ed. London, 1951). 


33. A parallel to this in some ways is the origin of Hindu- 
stani, which was also developed for military and commercial 
purposes. In the case of Hindustani, it should be noted, the 
impact was of a non-European Muslim culture. 











34. See Sen, Bangala sahitye gadya, 1-10, and De, op. cit., 
Appendix I, 455-86. According to many scholars, the earliest 
specimen of Bengali prose is the Siinya-purana of Ramai Pandit 
(published by the Bangiya-sahitya-parisad, Calcutta, 1912). 

S. K. De, Haraprasad Sastri, and others, find in this work 
“traces of the original writings of Ramai Pandit, going back to 
at least the 13th century A. D.”. The question of the date of the 
work, and in fact the question of whether or not it is really 
prose, are still open. There is hardly space to deal with them 
here. 


35. Alaler gharer dulal of Tekcand Thakur (whose real 
name was Pyaricand Mitra), written in 1858. 











36. Discussions of this man and his life and works can be 
found in any literary history of Bengal in English, and in any 
of several biographies in Bengali. In English, see for example 
J. C. Ghosh, Bengali literature 137-48. 





37. See S. K. Chatterji, Origin and Development of the 
Bengali Language, Calcutta University 1926, Vol. I, 138-146. 


38. In the Basumati edition of Bankim-candra’s works 
(Bankimcandrer granthabali), Vol. 1, 3. Translation: 


An old woman had come to sell some pictures, and had 
fallen into their hands. The pictures were exquisite 
miniatures painted on ivory tablets. The old woman, 
eager for a sale, brought out a picture from underneath 
her garments. 


39. From Bonaphul’s Srestha galpa (Calcutta, 1957), the 
story called Amala. Translation: 














Today he will come to see Amala. The bridegroom’s 
name is Arun. At the mere sound of his name, it was 
if the glow of dawn (arun) was spreading in Amali’s 
breast. How many beautiful pictures she painted in 
her imagination of him — handsome, gracious, young, 
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strong, his hair dressed, a pafijabi on his body —a 
beautiful man. 


40. Bose, An acre of green grass 19. 





41. See Sen, Bangala sahityer katha, 173-4. 
42. Bose (op. cit. 19) describes it in this way: 


Kaliprasanna Sinha ... in his Nutam paficar nakga, 
first brought literary grace to the spoken tongue. If 
he had lived reasonably long, the dispute between the 
two styles might have been settled long ago. But he 
died unpardonably young, and Bahkim-candra was 
the only author worthy of imitation. 








43. From the essay entitled Sadhubhasga banama calit 
bhaga in the collection Prabandha samgraha Vol. I, 318-27 
(Vigvabharati, 1952). The feeling about the closeness of the 
relationship between Sanskrit and Bengali mentioned in the last 
passage is still widely held in Bengal. Actually, lexical con- 
siderations aside, Bengali is probably one of the Indo-Aryan 
languages farthest removed from Sanskrit. See Chatterji, 
Origin and Development of the Bengali language. 














44. Sen, Bangala sahitye gadya, 157-205. 





45. Published under the title Yurop prabasir patra; see 
Sen, Bangala sahitye gadya 192 ff. 
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Present trends 


0. British India was partitioned in 1947 and two new inde- 
pendent states came into being. One of them is the Islamic 
Republic of Pakistan, whose two zones lie geographically more 
than a thousand miles apart. The eastern half of it, which has 
a little over half the total population of Pakistan, is known as 
East Pakistan. It also happens to be the eastern half of the for- 
mer Province of Bengal whose other half now belongs to the 
Republic of Bharat. We thus today have Bengali spoken in two 
immediately contiguous lands belonging to two independent 
states with many conflicting interests. There is, of course, a 
significant distinction between the Bengalis of the two republics: 
East Bengal Pakistanis are overwhelmingly Muslims, whereas 
West Bengal Indians are predominantly Hindus. On the other 
hand, the language of East Pakistan is not understood by the 
people of West Pakistan. The latter, although belonging to 
various linguistic groups, have shown a general inclination to 
accept Urdu as the language of culture education and adminis- 
tration. 


This paper is primarily about Bengali as it is spoken today 
in East Pakistan. The total area of East Pakistan is about one- 
sixth that of the whole of Pakistan, but it has a large variety 
of dialects, some of them even mutually unintelligible. In the 
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very near future a new standard colloquial language with a spe- 
cial prestige value may pattern out a blend of these dialects. 

If and when the bond with West Pakistan with its different set of 
Indo-Aryan languages becomes more intimate, coupled with a 
greater zeal for Islam, the Bengali of East Pakistan may devel- 
op its potentialities in a direction quite different from that of 
West Bengal. There is also the spirit of nationalism in the 
country which devoutly wishes to replace English by Bengali in 
as many spheres as possible. The intention of this paper is to 
explain in some detail the various facets of these issues by in- 
quiring into the major features of these dialects and their rela- 
tionship with other social, religious, political, and cultural 
values. 

Any field work for collection of new data concerning the 
dialects of East Pakistan was beyond the scope of the present 
study. The broad facts of the local dialects were culled chief- 
ly from the Linguistic Survey of India,'! which offers copious 
texts from different dialects of Bengal. General background in- 
formation and material on the current situation come from the 
author’s personal experience. His special advantages for this 
may be summarized: (1) native speaker of a highly divergent 
local dialect, Noakhali, (2) long residence in Dacca, the capi- 
tal of East Pakistan, (3) extensive travel throughout Bengali 
speaking territory as a result of his father’s being a civil ser- 
vant, and (4) active participation in the life of the University, 
where the language problem is perhaps felt most acutely. An- 
other important aid in the writing of this description of the 
Bengali of East Pakistan was the unpublished doctoral thesis of 
C. A. Ferguson on the grammar of Bengali.” 


1. An educated Bengali in East Bengal (Pakistan) as well 
as in West Bengal (India) is fully aware of three distinctly dif- 
ferent kinds or styles of Bengali, the use of which in various 
socio-cultural spheres is rigidly restricted by well-defined 
traditions. One does not use one’s local dialect (unless it hap- 
pens to be one of or around Calcutta) in the office, University 
or in any discussion tinged with formality. It is used only with- 
in the home or informally outside. The second is the Standard 
Colloquial Bengali (SCB) which has developed out of the local 
dialects spoken in and around Calcutta, the former capital and 
industrial and cultural center of unpartitioned Bengal. This is 
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a standardized form which, like the dialect of Tuscany for 
Italian, ? was accepted by all Bengalis as the elegant mode of 
speech of all Bengalis in all formal circumstances outside the 
home. Another significant feature of this colloquial form is 
that it can also be used in writing. The third kind is Standard 
Literary Bengali or High Bengali (HB), which is only a literary 
form and is never used colloquially. One might describe the 
state diagrammatically as: 











1 2 3 
Local Dialect High Bengali 
Spoken only Only written 
at home Stand. Coll. Bengali 








both written and spoken 























The stratification is also graded in terms of prestige value. 

The local dialect, when widely removed from the patterns of 
the SCB such as any East Bengal dialect may be, is looked d 
down upon as rude and vulgar. It has been used by popular 
writers of fiction as a means to evoke laughter. The rural pop- 
ulation, who form ninety percent of the total and are mostly ag- 
riculturists, know almost nothing of the correct forms of SCB. 
They use the local dialect in all circumstances. SCB is the 
form one must adhere to in polite company; it is the form the 
radio and the cinema communicate through; it has prestige even 
as a written form. High Bengali (HB) rates even higher. If the 
topic is grave, warranting a scholarly discussion, even the SCB 
is rejected in favor of the HB. Only in recent years has there 
been a shift in emphasis, with SCB usurping a major share of 
the glory that formerly was exclusively bestowed on HB. 

The relationship between HB and SCB is the subject of a 
paper by E. A. Dimock in this volume;* emphasis will be given 
here to the characteristics of the local dialects of East Paki- 
stan and the consideration of certain general problems and pres- 
ent trends. 


2. Accepting the possibilities of other kinds of variation 
and other kinds of interrelationships among the variants, we 
can for the purposes of our discussion tentatively follow Grier- 
son's classification’ of the dialects of East Bengal (coinciding 
approximately with present-day East Pakistan) into four main 
groups. 
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1. Northern Bengali. The dialects of Dinajpur, Rajshahi, 


Bogra, Pabna. 


nN 


Rajbangshi. The dialects of Rangpur. 


Ww 


Eastern Bengali. 6 The dialects of (a) Dacca, Mymensing, 
Tippera, Backer- 
gunge, Sylhet, 
Haijong. 


(b) Faridpur, Jessore, 
Khulna. 


4. Southeastern Bengali. Chittagong, Noakhali, Chakma. 


In the absence of any systematic study regarding mutual intel- 
ligibility among these dialect groups it is difficult to state pre- 
cisely how they stand in relation to each other.’ We can only 
generally describe it as follows: (1) dialects of the fourth group 
are progressively less understood as we move upwards from it 
to the first group (where they become almost unintelligible); (2) 
the dialects of the first three groups may be understood by all 
people all over East Pakistan including speakers of the fourth 
group; (3) Sylhet of the third group, insofar as it shares closely 
the characteristics of the fourth group, has also approximately 
the same area of mutual intelligibility. Haijong of the third 
group and Chakma of the fourth group are spoken by two ex- 
tremely small groups of people and are not easily understood 
by even the speakers of the nearest dialects belonging to the 
same group. The latter has a writing system all its own, wide- 
ly different from Bengali orthography. 

Let us now examine some of the significant features that 
differentiate these dialects from one another. 

In phonology, as shown below in 3., there is very little de- 
viation from the inventory of phonemes of SCB, a fair amount 
of deviation in the phonetic actualization of certain phonemes 
in Eastern and Southeastern Bengali as opposed to all other 
dialects, and considerable difference in the distribution and 
frequency of certain phonemes. On the whole, however, apart 
from a few striking features of pronunciation there is relatively 
little variation in phonology. 
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In morphology the picture is different. It is true that the 
basic structure of the word is about the same in all dialects, 
but the actual form of many derivational and inflectional suffixes 
varies enormously from one dialect to another, and this is prob- 
ably to be regarded as the most important area of divergence 
either between these dialects and SCB or among the dialects 
themselves. The facts are summarized in 4. 

The dialects of East Bengal do not vary very much in syn- 
tactical patterns except in certain negative and interrogative 
construction. In Chittagong-Noakhali dialects the negative par- 
ticle na may be used before the verbal form: /no zaiyum/ ‘I, 
we will not go’ as opposed to SCB use after the verb /Jabona/. 
Example of the interrogative ki * what’ : nam ki tsr/ ‘What is 
your name?” as opposed to SCB /tomar nam ki/. 

It has not been possible to make a systematic investigation 
of the lexical differences among the dialects, but it is the au- 
thor’s feeling that this will not prove as important as morpho- 
logical variation. 

The phonemes of SCB may be listed as follows: ® 


k kh g gh py i u T a 
t th j jh e ° fad & 
t th d dh ae % z& % 
t th d dh on a a 

p ph b bh m 


eageied eS 


Some problems in the analysis cf Bengali phonology are: (a) 
whether in addition to an alveolar or palatal /3/ we should also 
posit a phonemic dental /s/; (b) whether d and r are two 
phonemes or variants of one; (c) whether bilabial aspirated con- 
sonants are phonetically stops or fricatives; (d) the status of 
the semi-vowel y; and (e) the status of aspiration. They are 
mentioned here because interestingly enough they also repre- 
sent significant points where dialectal variation is conspicuous. 
One more thing. Both Muslims and Hindus use SCB and 
both have absorbed certain influences of Persian (through this 
of Arabic) which reigned as the language of the court of Bengal 
and India for six hundred years. There are today about 2500 








sta 


/s/ 


con 


Cor 
Als 
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words of Perso-Arabic origin in Bengali, and a few suffixal 
forms like -giri, -dar, used by monolingual speakers of Benga- 
li. In general the Persian sounds have been adapted to the pho- 
nological structure of Bengali. But such adaptation has not 
been so thorough with Muslims as with Hindus. The tradition of 
teaching children how to recite the Holy Qoran correctly is still 
very widely practiced. Also, Muslim names in Bengal are chief- 
ly Arabic names, and many words relating to the practice of Is- 
lam are Arabic. One definite feature of “Muslim phonology” is 
the fact that many Muslim speakers of SCB seem to make con- 
sistent distinctions between s and § and between z andj. It is 
difficult to find absolute minimal pairs, but these four sound 
types in the speech of such Muslims cannot be accepted as con- 
textual variants of two phonemes. In this kind of SCB [s]| and 
[z] occur intervocalically and initially before a vowel. Fre- 
quently occurring examples include asia, samad, zaman, azad, 
aziz. All these words the vast majority of Hindus would pro- 
nounce with [8] and [j]; we may therefore include these two as 
phonemes /s z/ of SCB as spoken by the average educated Mus- 
lim Bengali. 

Specific phonological differences among the dialects of East 
Pakistan may now be taken up. 

(a) NORTHERN BENGALI 

Phonemically it has the same system as SCB. Some od- 
dities in a series of lexical items indicated that in some in- 
stances NB has /r/ for SCB /r/, /1/ for SCB /r/, and in Malda 
/s/ for /3/. In vowels from the point of view of SCB there is a 
confusion between /o-2/ and /z-e/. 





Examples: NB SCB 
/sorile/ /sorire/ (in the body) 
/ peet / /pet/ (belly) 
/boro/ /boro/ (big) 


(b) RAJBANGSI 

Phonemically the same. Examples of deviations from SCB: 
Confusions between /s-th/, /l-n/, /z-j/, /d-dh/, etc. and /o-9/. 
Also occasional elisions of certain consonants. 


/ase/ /athe/ (is) 
/zela/ /jela/ (District) 
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/ randhite/ / ridte/ (to cook) 
/ gorbo/ / gorbhs/ (pregnancy) 


(c) EASTERN BENGALI 

The distinction here is in the very phonemic system. East- 
ern Bengali has no /t/, /th/ or /s/, and no /j/. Instead it has 
/¢/, / ¢h/] (affricates of the ts variety) and /z/. It has alsoa 
glottal stop. The series of voiced aspirated stops does not oc- 
cur. 

Sylhet has fricativized variants for both /k/ and /p/ witha 
greater frequency of occurrence than the non-fricative variant. ? 
They seem also to be in free variation with /kh/, /ph/. Thus 
we have [k], [p], medially, when the fricative counterpart oc- 
curs in the preceding syllable or when it is geminate; [x], [] 
elsewhere. In vowels it has no /x/. Like Chittagong-Noakhali 
it possibly has consonantal /y/ and /w/. 

With reference to SCB pronunciation of words, EB shows 
further confusions between /r-r/, />-0/, /z-e/. Initial /s/, 
and sometimes medial /k/ of certain SCB words, is dropped or 
replaced by /h/. On the other hand, initial SCB /h/ in EB is 
not often pronounced, and medially is replaced by a glottal stop. 
These characteristics in ordinary conversation immediately be- 
tray the identity of the speaker as one belonging to EB. 

Dialects of Faridpur-Jessore-Khulna constitute the link 
between East Pakistan Dialects and SCB. Geographically and 
linguistically they are nearer to Central West Bengal in the or- 
der mentioned. Where SCB has /ch/ Rajbangsi has [s], and 
for /j/ it has [z], but /@/ is not [¢], nor are /s/ and /h/ lacking 
as in other EB dialects. 

(d) SOUTHEASTERN BENGALI 

The phonemic characteristics of EB are here more con- 
spicuously observable. When compared with SCB the devia- 
tions are found to be thorough and complete. There isa 
marked divergence in vocabulary also. At least that is what 
one suspects from the radically different phonetic appearance 
of words. /k/ seems to be spirantized. Grierson notes its 
similarity to the Arabic voiceless velar spirant. The conso- 
nantal /y/ and /w/ are distinctly tense and cannot be compared 
with the non-syllabic front and back vowels, which are rela- 
tively lax. 
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4. Inthis section some examples of the dialectal variants 
of the suffixal morphemes will be given. They are stated in 
terms of the SCB norm. 

(a) NORTHERN BENGALI 

Nominative -e has in the plural /-te/: /thaote/ ‘children’. 
Objective plural suffix -der, -digoke has /-dek/: /takordek/ 
‘servants’. A very unusual form of dative termination is /-gune/: 
/amakgune/ ‘to me’, /bapakgune/ ‘to father’. 

The Locative -e has the alternants /-et, -ot/: /ka&Stet/ ‘in 
wood’, /deSot/ ‘in country’. 

In verbal forms, the participial -e (HB -iya) has the vari- 
ants /-z/, /i/ after a stem-final consonant or vowel respec- 
tively. Ex: /koree/ ‘having done’, /pai/ ‘having received’. 

The personal ending in the lst. per. future is /-im, -am, 
(-mu)/ instead of /-bo/: /bolim/ ‘shall say’, /Jam/ ‘shall go’ 
etc.; cf. SCB /bolbo/, /jabo/. The 3rd per. has no vowel fi- 
nally: /hoil/ ‘it became’, /thil/ ‘it was’; cf. SCB /holo/, 
/chilo/. 

In bordering areas near Malda /-gor/ for genitive plural 
and /-mu/ for lst per. fut. may also be found. 

There are isolated instances of adverbial or loc. -e being 
/-i/: ./samni/ ‘in front’, /asti/ ‘slowly’. 

(b) RAJBANGSI 

Nom. pl.: /-gula, -gila/, gen. pl. -(e)r plus /-ghor/: 
/balokgula/ ‘boys’, /fakorerghor/ ‘of the servants’. 

Obj.: /-k/ as in NB. 

Genitive /-kar/: /balokkar/ ‘of boy’ 

Loc. /-ot, -ot/: /balokot/. 

Verbal forms: 
lst pers. present and perfect /-3/: /ath/ ‘Iam’ 


past /-nu/: /athinu/ ‘I was’ 
future /-m, -mu, -mo/ 
2nd pers. past /-lu, -llu/: /kaillu/ ‘I did’, 
*‘ycu did’ 
future /-bu/. 


(c) EASTERN BENGALI 

First the forms that are generally found in this group are 
mentioned, then a few specific references are made. 
For SCB obj. suffix /-ke/, EB has /-ere, -re/: /bapere/ (to 
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father). For SCB gen. /er/ EB has /or/, gen. -obj. pl. is /-go/: 


/¢akorgo/ ‘to servants’, /tago/ ‘their’. In Mymensing obj. plu- 
ral has /-rare/: /dusrare/ ‘to friends; in the locative /-9, -t/: 
/dilo/ ‘at heart’, /golat/ ‘on neck’. 

Verbal forms: 
Ist per. fut. /-mu/: /komu/ ‘I shall say’. 
In Mymensing/-am/. In the 3rd per. past the endings are 
(Mymensing) /-lo, -1/. 


For SCB participal -e we have /-a/: /koria/ or /koira/ ‘having 
done’. In Tippera it is added after boudling of the final conso- 
nant of the stem: /boilla/ ‘saying’, /uitta/ ‘getting up’. 

The infinitive in Tippera is /-to, -tam/ instead of SCB 
/-te/: /borto/ ‘to fill’. /koitam/ ‘to say’. In Mymensing it is 
/-ot/: /borot/ ‘to fill’. Infinitives for forming inceptive com- 
pounds show even greater contrast when /-bar/ is used for SCB 
/-te/: /aigaibar laiglo/ ‘began to approach’. In Sandip /-on/ 
is used: /karon dilo/ ‘began to do’. 

In Haijong: 

Nom. /-ra, -a/ : /polara koe/ ‘the child says’ 

Obj. /-ge, -ra/ : /polaroge thole/ ‘she placed the child’ 
/takaradi/ ‘give the rupee’. 

Gen. /-lak, -la/ : /rajalak/ ‘of the king’ /ranila/ ‘of the 
queen’ 

Loc. /-mi, -ni, -mini/ : /de3oni/ ‘in the country’. 

As regards verbal forms, Haijong, like all the other dia- 
lects, uses the SCB forms freely but also has “a set of forms 
for the various tenses which do not change for the number or 
person” possibly “due to the influence of Tibeto-Burman Lan- 


guages”, 1° 
Example: 
Past /-bar, -ban/ 
Fut. /-ingo/ 
Infini. /-bake, -bak/ 
Imper. /-marek, -mar/ 

In Sylhet, 
Gen. sing. /-or/ : /goror/ ‘of homes’ 
Loc. /-t, -o/ : /gorot, goro/, ‘in house’ 
Nom. plu. /-ain/ : /gorain/ ‘houses’ 
Past 2nd /-lai/ : /dekhlai/ ‘you saw’ 
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3rd /-la/ 
Fut. lst /-mu, -bam/ 
2nd /-bai/ 
3rd / -ba/ 
Conditional /-tam, -tai, -ta/ (past) 


" /-ain/ (present, 3rd, honorific) 


Progressive present 
lst /-tram, -yar, -ram/ : /zaitram/ etc. 
‘Iam going’. 
2nd /-trai, -tre/ 

Imperative /-un, -uka/ 

The infinitive of purpose or desire has the ‘most notice- 
able construction’ in Sylhet. The ending (SCB /-te/) changes 
for person /-tai, -to, -ta/: 

/tumi zaitai ¢ao/ ‘you wish to go’ 

/he zaito ¢ae/ ‘he wishes to go’ 

/tain zaita ¢ain/ ‘he (hon. ) wishes to go’ 


(d) SOUTHEASTERN BENGALI 


Nom. /-e/ is very common. 
Abl. / -tun/ 
Gen. /-or/ 
Loc. /-ot/ 
Plural /-gun, -un/ : /kturgun/ ‘dogs’ 
Present tense 
lst /-r/ : /khair/ ‘I eat’ 
2nd /-38, -or, -or/ 
3rd /-er, -tan/ 
Past tense 
lst /-(g)yam, -gi/ : /korgyom/ or /kariyam/ 
or /korgi/ ‘I did’ 
2nd /-(g)yo, -1la/ 
3rd /(-g)ys, -1/ 
Future tense 
lst /-(g)yum, -bo, -bam/ 
2nd /-ba/ 
3rd /-be/ 
Imperative /-2/, when negative /-io/ : /no korio/ ‘do not’ 
Honorific /korotak/. 
Participial /-yare/ : /koriyare/ ‘having done’. 


(in Chakma /-inai/ : /jeinai/ ‘having gone’) 
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To take a final example from Chakma: 


Present Past 
Sing. & Plur. Sing. Plur. 
lst /agi/ ‘I am’ /elun/ ‘I was’ /elan/ ‘we were’ 
2nd /agos, ne/ /ele/ /ela/ 
3rd /age, ne/ /el/ / elak/ 


To summarize: the morphological peculiarities of the 
individual dialects of East Pakistan are most striking in three 
features: a) the pronominal stems and their related forms; b) 
case endings of nouns and pronouns; and c) the tense-morphemes 
and personal endings of verbs. 

Example (a) 

“I” is /ami/ in SCB, /mui/ in NB, /&i/ in Noakhali (SE), 
/moe/ in Chakma (EB). “Of me” is /amar/ in SCB, /mor/ in 
NB, /ma-lagik/ in Haizong (EB), /mo/ in Chakma (SE), /ar/ 
in Noakhali. “Of us” is /amader/ in SCB; /hamar/ NB; /amrar/ 
Mymensing (EB); /morgo/ Barisal (EB); /mogo/ Dacca (EB); 
/amalak-beden/ Haijong (EB); /Aarar/ SE; /amarar/ Chadme 
(SE). 

Example (b) 


SCB: /%akorder/ ‘to servants’ 
NB : /&akordek/ 
EB : /¢ahorgo/ 
SE : /¢haorgun/ 

or, SCB: / golae/ ‘on neck’ 
EB : /golat/ 

Example (c) 
SCB: /bolbo/ ‘(I) shall say’ 
NB : / bolim/ 
EB : /bolmu/ 
SE :  /koiyum/ 


In offering a general description of the morphological pecu- 
liarities of these dialects of East Pakistan we should take into 
account the fact that most of these dialects may also use forms 
that are virtually identical with those of SCB, freely in alter- 
nation with forms that do not occur in SCB. 


5. It is clear from the preceding section that for many 
East Bengalis there is aco iderable difference between local 
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dialect andSCB. Asa result, the people of East Bengal, in 
particular the speakers of Eastern and Southeastern Bengali, 
always suffered one disadvantage: they tried hard but never 
could satisfactorily wield this tool of SCB in normal verbal traf- 
fic. It was not the grammar that bothered them most, for the 
rules were simple and few and one more or less got to know 
them along with a general education. But when they grappled 
with it in actual utterance they floundered miserably, and the 
more wary they were the more pitiful became their lapses. 
Girls, who may be more eager in attaining grace and elegance 
both in personal appearance and in social behavior, fared bet- 
ter in the struggle. The men lagged behind. Even when their 
grammar was correct in terms of SCB their pronunciation of 
individual items was often deplorable, their intonations too dia- 
lectal. Socially this had been very compromising. !! 

But the situation today is different. Now SCB can also be 
looked upon as a standard form of Bengali spoken in Indian West 
Bengal. The spirit of antagonism against an invidious distinc- 
tion among dialects is thus reinforced. Literally there are 
fewer people around in East Pakistan to jibe at you for your 
frailties. A sense of release from constraining obligations in 
verbal behavior is noticeable everywhere. Intellectuals are 
found engaged in discussing economics, philosophy, art, poli- 
tics all in a language not definitely the same old SCB. When 
boy meets girl today at the university they freely begin to ex- 
change innuendoes in dialectal forms which a few years ago 
would have acted as a veritable antidote to any embryonic ro- 
mance. Even the broadcasting house is now tolerant of many 
‘talkers’ who satisfy their listeners enormously in spite of many 
deviations from SCB in pronunciation. 

What strikes one most in the situation is the absence of any 
one standardized spoken form which everyone aims to cultivate. 
It is not true that everyone is using his own dialect with a venge- 
ance in ind outside tiie home. The necessities of modern living 
are bringing about an incessant commingling of diverse dialects 
of many tiny groups of people living close together and are also 
creating the hankering after a common convenient standardized 
form. There is a conscious effort to choose forms that are 
common with the listeners’ dialect or that are presumably 
widely understood, an effort to reduce the idiosyncrasies of 
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one’s own dialects, and an attempt to incorporate the easily imi- 
table allied forms of the SCB as well as HB within the frame of 
this naturally emerging standard colloquial Bengali of East Pa- 
kistan. The past tense /hoedilo/ ‘it was’ is not being replaced 
by such forms as /(h)oisilo/ (Dacca), /oisil/ (Noakhali), /oisele/ 
(Bar.) or by HB /hoiathilo/ but by something that strikes a mean 
between SCB and HB: /hoi¢hilo/. Both SCB and HB influence it 
in retaining the initial /h/ but the modification of the affricate 

is characteristic of EP. Similarly, the first person future end- 
ing is commonly /-mu, um/ instead of SCB /-bo/. 

For the gen. -r you may have /-go/; /-re/ instead of SCB 
/-ke/, /-er/. /-e/ may be used oftener in the nominative. Pro- 
nominal /tara/ may take over /ora/ ‘they’. Phonemically the 
chances of remaining very close to SCB are very high except in 
the area of the affricates. The deviations from SCB will be 
still conspicuous particularly in words containing /r, r/ or /9, 
o/. All these portend to one thing: the Dacca-Mymensing group 
of dialects may play the major role in moulding the Standard 
Colloquial of East Pakistan. As the capital city grows, becom- 
ing the center of economic, social, political and cultural activ- 
ity of the whole of East Pakistan, its dialect, too, will gain in 
gravitational force. The SCB based on the dialect of Calcutta 
and its surroundings may be conjectured to have taken shape 
the same way. 

The whole situation in EP is still in such a transitional, 
formative stage that all predictions remain open to question. 
Moreover, its very transitional nature has added more com- 
plexity to the interrelationship among the already existing styles 
of Bengali. The new form in East Pakistan is not well defined 
or codified and as such hardly acts as a substitute for the old 
SCB which is still the form to be adopted in all strictly formal 
circumstances such as the radio, the classroom, the pulpit. 

No EP dialect or any blend of EP dialects as yet can be used in 
writing. Hence, making the burden worse, the educated EP 
speaker has now four instead of three models of Bengali to jug- 
gle with. They may be diagrammatically shown as: 
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Formal Speech and 
in Writing. 











/ai zaiyum/ /ami jamu/ /ami jabo/ /ami jaibo/ ‘I 
shall go’ 
/mui zaibam/ 


/ami zamu/ 


Notes 
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ta, 1938, p. 47. 


4. E.A. Dimock, Literary and colloquial Bengali in modern 
Bengali prose. 


5S. Ge. cit., p. 19. 


6. Note that the territory covered by the linguistic term 
“Eastern Bengali” does not coincide with the political entity 
East Pakistan. 


7. The Bengali dialect situation is similar, on a smaller 
scale, to that of Hindi as characterized by John Gumperz, Lan- 
guage Problems in the Rural Development of North India, The 
Journal of Asian Studies, 1957, Vol. XVI, No. 2. Saag 


8. Cf. S. K. Chatterji, A Brief Sketch of Bengali Phonetics 
BSOS, London, 1921, Vol. II, Part I; C. A. Ferguson and M. 
Chowdhury, The phonemes of Bengali, Language 35:4 (1959). 





9. This comment is based on personal observation; it is 
not clear from Grierson's material. 


10. Grierson, op. cit. 


11. On the other hand the dialects of East Bengal had some 
effect on the forms of SCB. Tagore pointed this out in 1938 
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giving as illustration the fact that the postpositional athe ‘with’ 
had crept into SCB and was beginning to displace songe: 

In verbal forms, the alternants of the first and third per- 
son endings with the past morpheme -1- shifted from -lum, -le 
to -lam, -lo. Possibly, the gradual acceptance of a fully aspi- 
rated pronunciation -th- of the progressive morpheme -é- as 
equally elegant SCB is another instance of this influence. Cer- 
tain features of phrase structure and the use of tara ‘they’ for 
ora may also be cited. 














SOCIAL DIALECTS IN DHARWAR KANNADA 


William McCormack 


University of California, Berkeley 


0. Introductory — background and procedure 
l. Differences, Brahmin : non-Brahmin 


2. Differences, backward classes : other classes 


0.1 In Dharwar, and in the adjoining districts of Belgaum 
and Bijapur, there appear to be three dialects of conversational 
Kannada which correspond to the three main cleavages in the 
social system. The three social classes to which there is a 
corresponding class dialect are the Brahmin, the non-Brahmin, 
and the Harijan. These three social levels are known popularly 
as the ‘advanced’, ‘middle’, and ‘backward’ classes, respec- 
tively. The principal caste represented by the advanced classes 
in the three districts are the Deshastha Brahmins, who belong 
either to the Smarta (Saivite) sect or to the sect of the Madhvas, 
who are primarily worshippers of Vishnu. Deshastha Brahmins 
number about 2.5% of the population of Dharwar, Belgaum and 
Bijapur districts. The total population of these three districts, 
which were called collectively the ‘Bombay Karnatak’ before 
their merger with the new Mysore linguistic province on 1 No- 
vember, 1956, was reported in 1951 to be 4,698,479. The mid- 
dle classes of the three districts are mainly followers of the 
Lingayat, or Virasaiva, religious sect, whose members form 
about 20% of the population in the new Kannada- speaking lin- 
guistic state. Lingayats are concentrated in their greatest 
strength in Dharwar, Belgaum, and Bijapur districts, where 
they account for about 35.5% of the population. The two princi- 
pal castes of the Lingayat sect are everywhere the Jangams, or 
priests, and the Banajigas, or traders. The backward classes, 
whose hereditary occupations are held to be ritually unclean by 
traditional Hinduism, incluze about 18% of the population in the 
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new Mysore state, and these classes are not concentrated in any 
one or few districts. According to the 1931 Indian census, back- 
ward classes (Harijans) form 8.0% of the population of Dharwar, 
Belgaum, and Bijapur. But a significant proportion of the peo- 
ple practicing ‘unclean’ occupations in these three districts are 
Telugu speakers. In 1951 1.3% of the combined population of 
these three districts were returned as Telugu-speaking, and 
their hereditary occupation is, by inference, that of sweeper. 
The principal Kannada- speaking castes which fall into the ‘back- 
ward’ classification are the Madigas and the Holeyas. 


0.2 Social dialects are recognizable in colloquial Kannada 
only, and not in the literary dialect of so-called ‘educated 
speech’. Just as the literary dialect is essentially uniform for 
the three major regional dialects of Kannada, namely, the Dhar- 
war, Bangalore and Mangalore types, so also the spoken graph- 
ic forms are the same for speakers of different social levels in 
the Dharwar region. A Kannada speaker requires about seven 
years of formal schooling before he can understand the literary 
dialect completely, and some practice or additional years of 
formal education may allow him to master it as a spoken style. 
Statistics given by the 1951 census show that 1.25% of the popu- 
lation in Dharwar, Belgaum and Bijapur districts had undergone 
seven years or more of formal education. But even among edu- 
cated people, the spoken graphic forms constitute an ‘artificial’ 
style, which is mainly limited to the stage, to formal oratory, 
and to pedantically-inclined individuals. Section 0.5 below de- 
scribes some of the phonemic features of ‘educated speech’. 


0.3 A preliminary study was made to find out with what 
degree of accuracy a small sample of the population of Dharwar 
district could recognize the social level of a speaker from about 
five minutes of his tape-recorded monologue. A large panel of 
twenty-eight primary school teachers listened to seven speakers 
of different social levels, and a small panel of nine college stu- 
dents listened to twelve speakers. Five entries were to be 
filled in for each speaker: his caste, native place, age, educa- 
tion, and family income. The tapes were edited of content clues 
to the identity of the speaker, and they were made by the stand- 
ard procedure of asking the speaker to imagine family situations 
and to speak to relatives or friends on homely subjects. The 
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caste of the speaker was correctly identified by the panels 40% 
of the times, misidentified 35% of the time, and there was 25% 
no response to the caste question. The three Brahmins of the 
Dharwar region were among the speakers most often correctly 
identified. One of these Brahmins scored 87% correct identifi- 
cation, and as his speech was full of literary borrowings, we 
might infer that his high score was due to his confirming the 
social stereotype that ‘Brahmins speak like books’. The four 
backward class speakers were almost never correctly identified, 
and the reason for this was that they spoke a mixture of Brah- 
min, Lingayat and literary forms which effectively masked their 
own distinctive class dialect. Some urbanized backward class 
speakers regularly use such a mixture when they speak before 
members of other social classes, but it is a fact which most 
Dharwar people do not recognize because they automatically as- 
sume that a backward class speaker will use his own distinctive 
linguistic forms. The best recognition score for a backward 
class speaker was 25%, two scores were 0% and one was 10%. 
There were no 0% scores for other than backward class speakers. 
The best recognition score, 89%, was made by the smaller panel 
on a Lingayat speaker, but there was a content clue in the form 
of a Lingayat personal name which was not edited from this tape 
and this probably accounts for the high score. The small panel 
scored 78% identification of another Lingayat and 56% on a third. 
The large panel scored 48% identification of one Lingayat speak- 
er and 32% on the other Lingayat whom they listened to. 


0.4 This paper will attempt to present some of the mor- 
phological differences in the speech styles of Dharwar Brahmins, 
Lingayats, and members of the backward classes. The compar- 
ison is based on about an hour’s tape-recorded material in which 
Brahmins were speaking in imaginary family situations and on 
forty minutes’ tape-recorded conversation in a backward class 
family’s house when only family members were present. (The 
recording machine was operated by a family member.) A Lin- 
gayat linguistic informant then listened to these two tapes, and 
the forms which he would say differently were noted. About 
220 forms were marked as Brahmin-Lingayat differences, and 
fifty-two forms were marked as backward class — Lingayat 
differences. Lexical differences accounted for about 20% of the 
Brahmin-Lingayat differences, whereas only three lexical 
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differences between backward classes and Lingayats were noted 
from the taped conversation. 


0.5 The possibility of differences in the phonemic structure 
of the three social styles of speaking Kannada in Dharwar is 
complicated by the problem of individual differences in the use 
of loanwords in different degrees of assimilation to the native 
phonemic pattern. The amount of material in the tapes on which 
the morphological comparisons are made in this study is not suf- 
ficiently large to handle this lengthier phonemic problem. Loan- 
words have come into Dharwar Kannada from English, Sanskrit, 
Hindi, Marathi, and probably from other languages as well. In 
general, it appears that loanword phonology in Dharwar is re- 
lated to individual factors like education, intelligence and moti- 
vation to learn languages, and the opportunities given by the 
professional contacts of the speaker to learn languages. For 
example, villagers who raise cotton and so market it at the lo- 
cal cotton-trading centers have extensive contact with Bombay- 
type Hindi, which other villagers and townspeople lack. Anyone 
with close relationship to government affairs, including village 
officers, knows a certain amount of English, Marathi, and 
Hindi. A person living in a town has more contact with lan- 
guages, as a rule, than the villager, and since the languages 
are taught in schools, the educated person generally knows more 
languages. The high rate of educated Brahmins relative to other 
classes, their traditional association with the Sanskrit language, 
their close association with government, their concentration in 
towns and cities, and the fact that intermarriage between 
Marathi and Kannada speaking families is more frequent among 
Brahmins all combine to increase the probability that a loan- 
word will not be assimilated to the native phonemic pattern of a 
Brahmin speaker. But these appear to represent a combination 
of circumstances affecting individuals, as it were indirect con- 
sequences of Brahmin caste membership, in contrast to the 
distinctive grammatical style which all Brahmins use as a di- 
rect consequence of their caste. The phoneme /f/, in contrast 
to the /p/ substitution of uneducated persons, appears in loan- 
word used by Brahmin speakers in the taped conversations 
which were used for the recognition of social level of speaker 
study (Section 0.3). But this seems to be more reasonably 
interpreted as a consequence of the English schooling of the 
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Brahmin speakers and of their contact with English in govern- 
ment service, rather than as a direct consequence of social lev- 
el or caste membership. The /f/, /p/ contrast and similar pho- 
nemic information might be characterized as-part of an ‘English- 
educated speech’ style. It seems doubtful, however, that there 
is a ‘Sanskrit-educated speech’ style, which would exist on the 
phonemic level apart from the (Kannada) ‘educated speech’ style 
mentioned in Section 0.2. This latter style incorporated pho- 
nemic distinctions between aspirate and non-aspirate stops, 
nazalized and non-nazalized vowels, and distinguishes three 
voiceless sibilants (a flat blade-alveolar /s/, narrow groove /s/, 
and retroflex /s/ which has an aspirate release as the second 
member of a consonant cluster when followed by a vowel), all of 
which are of recognized Sanskrit origin. These distinctions are 
not made in Kannada by uneducated persons, nor by those edu- 
cated persons who regard them as ‘artificial’. The Sanskrit 
palatal stop + homorganic nasal is rendered as /dny/ in the loan- 
word ‘knowledge’ by educated persons, in contrast to the col- 
loquial form of the word, /gya:na/. 


1.0 Differences in the phonemic shape of suffixes which oc- 
cur in identical morphemic environments with non-contrastive 
meaning are among the most important differences in the speech 
of Brahmins and non-Brahmins. One of these distinctive Brah- 
min suffixes is socially recognized as marking the isogloss 
which divides Brahmin from non-Brahmin speech. This is the 
verb suffix 3rd person-singular-neuter (i.) /adal/. There are also 
differences in the phonemic shape of verbal bases as Brahmins 
and non-Brahmins use them, and there are differences in the 
use of compound base in one style as against a simple base in 
the other. Kinship nouns of address and greeting suffixes have 
also been included in the discussion of the distinctive features 
of Brahmin colloquial style. 


1.1.1 Differences in the phonemic shape of verbal suffixes 
indicating person-number-gender of the actor are illustrated 
by four examples in this section. There are two contrasting 
sets of person-number suffixes in which gender is distinguished 
in the third person (singular: male, female, neuter; and plural: 
human, non-human). There is a third contrasting set, the im- 
perative, but this set lacks first-person-singular suffix, and 
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gender and number are not distinguished in the third person. In 
the examples, these sets will be marked as (i.), (ii.), and (iii.). 

Two allomorphs of the non-Brahmin suffix 3rd person-singular- 

neuter (i.) are shown, each of which was given as the non-Brah- 

min equivalent for the corresponding Brahmin form /ada/. The 

Brahmin form is listed first in all examples. 


/e/ ; /e/ lst person-singular (ii.) 
/e:vi/ ; /i:vi/ 1st person-plural (i.) 
/o:na/ ; /u:nu/ 1st person-plural (iii.) 


/ada/ ; /ayti/, /a/ 3rd person-singular-neuter (i.) 


1.1.2 Four additional person-number (gender) suffixes for 
which there are Brahmin non-Brahmin differences as in Section 
1.1.1 are listed below. These were discovered when the Brah- 
min forms listed in the preceding section were checked with 
Brahmin informants. The Brahmin suffix is listed first, fol- 
lowed by the corresponding Lingayat form. The more frequent 
allomorph in use by Lingayats of the morpheme 3rd person- 
singular-male (ii.) is /a/. The non-Brahmin allomorph /z/ 
listed here as different from the corresponding Brahmin /a/ is 
found with about 36% of the verbal bases which non- Brahmins 
(Lingayats) use. 


/e:ni/ ; /i:ni/ 1st person-singular (i.)! 

faemsfe sl 2nd person- singular (i.) 

/al ; /e/ 3rd person-singular-male (ii.) 
/a:va/ ; /a:vu/ 3rd person-plural-nonhuman (i.) 


There are fourteen (64%) person-number (gender) suffixes 
for which Brahmin and non-Brahmin forms are the same. Four 


are shared in set (i.), seven in set (ii.) and three are shared in 
set (iii.). 


1.1.3 Relational suffixes may be added to verbal bases in 
order to form conditional particles in Kannada, which have the 
syntactic function of phrase modifiers. Three suffixes of this 
type, in which the Brahmin form differed in phonemic shape 
from the corresponding non-Brahmin form, were found in the 
tape. The pairs of suffixes are as follows, the Brahmin form 
listed first: 
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/o:tle/ ; /ku:dle/ ‘soon after’ 
/likke/ ; /a:ka/ ‘to’ or ‘for’ 
/go:ta/ ; /konta/ Durative action 


1.1.4 There are locative suffixes in Kannada which are 
added to nominal bases to indicate the location of a noun with 
respect to the action of the verb. Two such suffixes may be 
added to the same nominal base, so that there can be base + suf- 
fix, + suffix2, as in /mani.kad.inda/ ‘from the house’. The 
starred forms in the following list indicate Brahmin forms which 
non-Brahmins also use in some contexts, but in at least one con- 
text (the context in which the Brahmin used the suffix in the re- 
corded conversation) the non- Brahmin would employ the second 
form which is listed here. 


*/inda/ ; /e:ga/ ‘from’ 

*/kade/ ; /la:ga/ ‘at’ 
/tuttu/ ; /suttu/ ‘around’ 
/olaga/ ; /a:ga/ ‘in that’ 
/e:ka/ ; /ige/ ‘to’ 
/ukka/ ; /ige/ ‘to’ 


1.2.1 One common difference between Brahmin and non- 
Brahmin colloquial styles is the use by Brahmins of compound 
verbal bases in environments where the other style has a sim- 
ple base. The use of compound bases is felt to be closer to the 
literary norms of written Kannada. Examples of this style dif- 
ference are given below. The Brahmin form is listed first, 
followed by the non-Brahmin equivalent, and then the form with 
a verbal suffix with which the Brahmin form occurs on the tape, 
and finally the non-Brahmin equivalent of this. Period indicates 
morpheme boundary in the examples. 


/ma:d.idd./ ; /ma:d./ /ma:d.idd.i./ ; /ma:d.i/ ‘you (sg) have done’ 
/bay.tir./ ; /bay./ /bay.tir.t.ara/ ; /bay.t.ara/ ‘they will abuse’ 
Ja:g.bit./ ; /a:g/ /a:g.bit.ade/ ; /a:g.e:ti/ ‘it is done’ 

/idd.ir/ ; /ir./ /idd.ir.beku/ ; /ir.beku/ ‘it might be’ 


1.2.2 The phonemic shapes of seven verbal bases which 
occur in the Brahmin tapes are different when non-Brahmins 
use these bases in identical environments. Two pairs, /ku:/; 
/kun/ and /hord/ ; /hon/, ‘sit’ and ‘start out’, respectively, 
have different base forms for Brahmins as compared to non- 
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Brahmins in all environments. The remaining five examples 
have shared base forms which in some environments both groups 
use. In the following list, the shared base form is listed last. 


/.go:./  ; /.go:n/ bound reflexive verb /.gond./ 
/had./ ; /hada./ ‘gave birth’ /hada./ 
faa: 3 Thm:./ > *go’ /ho:g./ 
Jaz:y./ ; /a:./ ‘become’ /azg./ 
/idd./ ; /ad./ ‘be’ /idd./ 


1.3 There are some generally acknowledged differences 
between Brahmins and non-Brahmins with respect to kinship 
nouns and other forms of address and greeting. Half a dozen of 
the examples given in this section occur in the tape, and, as non- 
Brahmins are quite familiar with these differences, the remain- 
ing examples were elicited from a Lingayat informant, and then 
checked with a Brahmin. The Brahmin forms and the non-Brah- 
min equivalents are listed below. Forms are starred to indicate 
that non- Brahmins also use these Brahmin forms, but in differ- 
ent contexts, which are explained in parentheses after the gloss. 
Differences in verbal suffixes for general greeting are listed 
first, and then kinship nouns are given. The Brahmin form is 
listed first. 


*/.0/ ; /.apa/ greeting any man, especially with the verb 
base ‘come’, as in /ba:r.o/ ; /ba:r.apa/ 
(Non-Brahmins use /.o/ in greeting 
intimate friends and inferiors.) 

*/.u/ ; /.be/ greeting any woman, with the verb ‘come’ 
/ba.O/ ; /ba:r.be/ (Non-Brahmins use 
/.u/ in greeting wife or any younger rela- 
tive, friends, and inferiors.) 

*/.u/ ; /.apa/ greeting any man, with ‘come’ /ba.O/ ; 
/ba:r.apa/ (Non-Brahmins use /.u/ in 
greeting younger relative or inferiors. 

*/.a/ ; /.evva/ addressing mother, with ‘come’ /ba:r.a/ 
; /ba.evva/, /ba:r.be/ (Non-Brahmins 
use /.a/ to greet wife only.) 

*/,a/ ; /.ekka/ addressing elder sister (as above) 

*/akka/ ; /ekka/ ‘elder sister’ (Non-Brahmins use /akka/ 
as a kinship term of reference only, 
whereas Brahmins use it both as a term 








/r 


la 


*/ 
*/ 








87 


of reference and as a noun of address.) 

*/akka/ ; /evva/ ‘mother’ (As above for non-Brahmins; 
/akka/ is a noun of address ‘mother’ for 
Brahmins and /avva/ is the term of 
reference among both groups.) 


*/anna/ ; /enna/ ‘elder brother’ (Non-Brahmins use /anna/ 
as term of reference.) 
*/anna/ ; /eppa/ ‘father’ (As above for non-Brahmins; 


/appa/ is the term of reference among 

both Brahmins and non-Brahmins.) 
/mama:vara/ ; /ma:va/ 

‘father-in-law’, ‘mother’s brother’, ‘fa- 

ther’s sister’s husband’, ‘elder sister's 

husband’, ‘brother’s wife’s father’ 
/atti.avara/ ; /atti/ ‘mother-in-law’, ‘father’s sister’, ‘moth- 

er’s brother’s wife’, ‘brother's wife's 


mother’ 

*/adji/ ; /emma/ ‘grandmother’ (Non-Brahmins use /adji/ 
for ‘great-grandmother’.) 

*/muttee/ ; /edja/ ‘grandfather’ (Non-Brahmins use /muttee/ 


for ‘great-grandfather’.) 


1.4 There is a residue, constituting about 10% of the 
marked Brahmin-non-Brahmin differences, which appear to be 
syntactical differences and are still to be analyzed. It is not 
known which of these differences are due to borrowing from lit- 
erary styles and which are genuine differences in the order and 
arrangement of morphemes. One syntactic difference is, how- 
ever, apparent from the Brahmin tapes, and that is the use of 
the relational suffix /anta/ in a much wider range of contexts by 
Brahmins, as compared to non-Brahmins. 


1.5 There is a larger residue, about 40% of the original 
list of forms which were marked as distinctive of Brahmins’ 
style, that was rejected as showing ‘no difference’ after re- 
checking with the informants. Most of these spurious differ- 
ences arose out of literary borrowings by Brahmin speakers. 
As the Lingayat listener was instructed to mark the differences 
between the taped conversation and his own colloquial speech, 
these forms were naturally included in the first list of differ- 
ences. But when they were checked with other Brahmin 
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informants, these literary loans were discovered to have arisen 
from the momentary pedantic inclination of a speaker in the tape 
and were not in colloquial use among Brahmins. A few of the 
spurious differences were due to the mental set of the Lingayat 
listener, who had been instructed to find differences, and upon 
rechecking with him these differences were found to be illusory. 


2.0 The majority of differences between the speech style of 
backward classes in Dharwar and the speech styles of the other 
two social classes appear to be differences in the phonemic 
shape of verbal suffixes which are used in the same morphemic 
environments as their equivalents in the other two speech styles. 
There are also differences in the use of compound verbal bases 
and in the phonemic shape of allomorphs of the same verbal base 
morpheme, as they are used by members of backward classes. 
There are some distinctive forms of address and kinship nouns, 


which only are to be found in the colloquial utterances of Harijans. 


But the lexical inventory, the vocabulary, of the Harijans’ speech 
appears from this study to be practically identical with that of 
other non-Brahmins in Dharwar. The tendency of urbanized 
members of the backward classes to substitute forms which are 
in use among members of other social classes for their own dis- 
tinctive forms was noted above in Section 0.3. This fashion has 
also included some forms which Harijans share with other non- 
Brahmins; especially the distinctive Brahmin 3rd person-singu- 
lar-neuter /.ada/ is favored by these urbanized speakers, rather 
than the non-Brahmin /.ayti/, which most Harijans use. Such 
borrowings as are made by these urban speakers will be clas- 
sified along with the literary loanwords, and they will not be 
considered as part of the colloquial style of backward classes. 


2.1.1 The distinctive person-number (gender) suffixes in 
the tapes of backward class speakers and their equivalents in 
non-Brahmin (Lingayat) style are given in this section. The 
suffix used by backward class speakers is listed first. The set 
to which each suffix may be assigned is noted as in Section 1.1.1 
by the figures in parentheses, (i.), (ii.) and (jiii.). 


/e:na/ ; /i:ni/ 1st person-singular (i.) 
/a:lu/ ; /a:la/ 3rd person-singular-female (i.) 
/ul/ ; /lu/ 3rd person-singular-female (ii.) 








/al 
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lal ; /e/ 3rd person-singular-male (ii.) (64% Lingayat 
bases take /a/ 
rather than /e/; 
ef. 1.1.2.) 

/e:vu/ ; /i:vi/ 1st person-plural (i.) 

/u/ ; /vi/ 1st person-plural (ii.) 


2.1.2 One additional person-number (gender) suffix which 
is distinctive of backward class speech in Dharwar was discov- 
ered when the forms given in Section 2.1.1 were checked with 
Harijan informants. It is listed below, with accompanying 
Lingayat form. 


/ur/ ; /ru/ 3rd person-plural-human (ii.) 


Backward class speakers share, then, fifteen (68%) person- 
number suffixes with the middle class Lingayats, and they share 
ten (45.5%) of these suffixes with Brahmin speakers. The Brah- 
mins and backward class speakers share one common difference 
from Lingayat speakers, as their 3rd person-singular-male (ii.) 
form is /a/. But this is not a major shared difference, since 
among Lingayats /a/, as the more prominent allomorph of 3rd 
person-singular-male (ii.) morpheme, occurs with three times 
as many verbal bases as the variant form /e/. 


2.1.3 There appear from the taped conversations to be 
three frequently used relational suffixes which have distinctive 
phonemic shapes for backward class speakers. The Harijan 
form is listed below first, followed by the Lingayat form, which 
is also the form used by Brahmins in Dharwar. ? 


/kee:sa/* ; /O/ (/O/ is phonemically zero) 
/nandre/ ; /re/ 


/ut/ ; [tal 


2.1.4 There are two examples in the recorded conversa- 
tions of locative noun suffixes, which have a distinctive phone- 
mic shape for backward class speech. One is /i:gi/, which is 
used as /ige/ by Brahmins and Lingayats. The second example 
is /a:g/, which only Harijans attach to personal nouns of ad- 
dress (names) and to kinship nouns. The Brahmin and Lingayat 
equivalents of /a:g/ distinguish two situations, where the back- 
ward class speaker would apply /a:g/ to both. A Lingayat uses 
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/an/ (Brahmin form is /anna/) when the action of the verb is 
directed toward the noun to which the suffix is attached, and he 
uses /ug/ (/aga/ is the Brahmin's form) if it is intended to as- 
sociate the noun with the action of the verb. 


2.2.1 There are a few examples of Lingayat-backward 
class differences with respect to compound verbal bases. Three 
examples from the tape are given below. The backward class 
form is listed first. 


/ma:d.id./ ; /ma:d./ /ma:d.id.e:na/ ; /ma:d.i:ni/ ‘I did’ 

/al.ikk./ ; /.idd./ /he:l.al.ikk.illa/ ; /he:l.idd.illa/ ‘didn’t say’ 

/a:/ ; /a:k.hat./ /hel.a:.tidda.ri/ ; /hel.a:k.hat.tidda.ri/ 
‘were telling, sir’ 


2.2.2 Three examples are listed in this section of Harijan 
and Lingayat verb bases which differ in phonemic shape in 
identical environments. A form of the base which both groups 
share is listed last. 


/.go:/ ; /.gon/ bound reflexive verb /.gond./ 
/gont/ ; /go:t/ ‘know’ / go:t/ 
/a:/ ; /a:k/ ‘become’ /a:k/ 


2.3 Examples of distinctive Harijan greeting forms and 
kinship nouns are given below, followed by their Lingayat 
equivalents. The verbal suffixes of general greeting are listed 
first, and then the differences in kinship nouns. 


/.a:ko/? ; /.apa/ greeting any man 

/.o/ ; /.apa/ greeting any man (The suffix 
/.o/ is also used by Brahmins, 
but with voice-qualifier ‘author- 
ity’, whereas Harijan usage may 
frequently involve the voice- 
qualifier ‘fear, trembling’; 
Lingayats use this suffix in 
greeting intimate friends and 
inferiore.} 


/aliya.deva.ru/ ; /.apa/ addressing son-in-law 
/bi:g.ru/ ; /.apa/ = males /.be/ = females addressing 
affines 


/ekka/, /eppa/, /emma/ etc. ; /akka/, /appa/, /amma/ etc. 


& 
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reference terms. (The backward 
class speaker uses the non-Brah- 
min nouns of address in both re- 
ferring to and addressing rela- 
tives. Lingayats have a separate 
. set of nouns for address and for 
reference.) 


2.4 No syntactical differences were marked by the Lingayat 
informant in about half a dozen repetitions of the tape of back- 
ward class conversation. There may be one or two differences 
in the use of stress and intonation morphemes, but further 
checking and analysis is needed on this point. 


say’ 


2.5 The residue of ‘imaginary’ differences which were 
marked from the backward class tape by the Lingayat listener 
was about 10% of the list of differences. These spurious dif- 
ferences were due to the mental set of the listener to find dif- 
ferences and not to the marking of literary borrowings, which 
were rare in this tape. 


Notes 


1, In Belgaum the Brahmin form is /i:ni/. 


2. Inthe Bijapur style of speaking the /kee:sa/ suffix is 
not limited to the speech of Harijans as it is in Dharwar and 
Belgaum. 


3. Inthe Bijapur District this usage is not limited to back- 
ward classes. 
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0. Introduction 


1. The Social Function of the Standard Stratum and its Relation 
to Local Dialects 


2. Linguistic Variation Within the Standard Stratum 


3. Conclusion 


0. The Hindi regional language area is the part of North 
India, where Hindi serves as the official medium of local gov- 
ernment and administration. This area, which includes the 
present states of Rajasthan, Delhi, Himachal Pradesh, Uttar 
Pradesh, Madhya Pradesh, Bihar and parts of Punjab, is not 
only the largest of the linguistic units on the subcontinent, but 
also stands apart in other respects. In other sections of the 
country regional languages are limited to compact, geograph- 
ically well defined areas, a fact which has made possible the 
recent organization of linguistic states. Furthermore there is 
in. each area a well recognized standard, known by a single 
name, which although often linguistically distinct from local 
dialects, has served as the prestige form for some time. Lit- 
erature in this standard is generally read and accepted by the 
educated, including those who know English well. All this is 
not the case with the Hindi area. Some forms of the modern 
standard language have been current since the early days of 
Muslim rule, but many distinct varieties exist. These varie- 
ties are spoken in geographically widely separated areas, used 
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in different social situations, under differing prestige conditions 
and are called by many different names. 

We find a multitude of conversational styles — apart from 
the local dialects — variously called Hindustani, Khari Boli, 
Dakkhini Hindi, Dakkhini Urdu, Bazar Hindustani, or also Urdu 
and Hindi.' They are spoken not only within the regional lan- 
guage area itself but throughout urban North India in centers as 
far apart as Karachi, Bombay, Lahore, Calcutta, Ahmedabad, 
Hyderabad and many others. The exact spread of all these 
forms of speech has never really been surveyed and there is 
furthermore little agreement on the application of the above 
names. Different individuals may give different names to the 
same variety while others use identical terms to refer to lin- 
guistically distinct varieties. On the literary level two distinct 
styles are usually recognized: Urdu, which takes its learned 
vocabulary largely from Persian and is written in an adaptation 
of the Perso-Arabic script and Hindi, which draws on Sanskrit 
for its loans and employs the Devanagari script. There is how- 
ever much stylistic diversity, even within these two named en- 
tities. One school of Hindi writers has introduced so many 
Sanskrit borrowings that its writings are incomprehensible to a 
speaker of the colloquial language trained only in Urdu. A sec- 
ond group writes in a way which is very close to the colloquial 
and which could become Urdu merely by changing script and 
substituting a few key terms. Other writers have introduced a 
number of local dialect features into their works thereby 
causing difficulties for readers from distant areas.” There is 
furthermore some controversy about the application of the term 
Hindi to the various medieval and post medieval literatures. 
The tendency in recent years has been to use Hindi for all texts 
produced in the present regional language area. But, many ob- 
ject to this practice and continue to employ the old dialect 
names. Literary Urdu is a little more uniform than Hindi but 
writers still vary greatly in the extent to which they borrow 
from Persian. 

The bulk of the linguistic studies so far have concentrated 
on one or another of the above named entities and have as a 
rule treated it as a separate unit with little reference to the 
speech community in which it was used or to related forms of 
the language. We have grammars of Hindi based on writings 
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in the Devanagari script which often include a number of the 
sub-regional dialects.* Analyses of Urdu may deal either with 
the literary style or with one or another of the spoken varieties 
current among the educated groups.* Of the grammars which 
carry the title Hindustani, some cover the style current in the 
Indian Army — others the so called Bazaar Hindustani or a 
somewhat Urduized urban speech style.° Even Grierson’s Lin- 
guistic Survey of India, which contains the most detailed infor- 
mation on speech distribution operates with unit dialects or 
styles. 6 It is moreover not based on direct field work, but 
rather on written information elicited by a somewhat circular 
procedure in which informants were asked to furnish translations 
of a standard text into one or another named speech form as 
known from local sources. These dialects were then again 
treated as separate units. Studies like the above, while useful 
for those whose interest is confined to literature or to the struc- 
ture of a single variety, nevertheless represent rather arbi- 
trary abstractions from actual linguistic behavior, which are of 
little aid in clarifying the bewildering range of varieties of 
Hindi-Urdu found in North India. If we are to understand speech 
distribution, we must describe it in terms of the society in which 
it operates. ’ 

The present study therefore takes as its unit of analysis 
not a particular style, but all forms of the standard language in 
the regional language area as defined in the first paragraph. 
We will disregard the traditional language names and will feel 
free to deal with any standard varieties used in this area, ex- 
cluding however others not found there such as the Urdu of 
Pakistan, or the bazaar Hindustani of Bombay and Calcutta. 
The paper is divided into two parts, part one describes the so- 
cial environment of the standard, its relation to local dialects, 
its development and function within the speech community. Part 
two presents a linguistic analysis of speech variation and its 
relation to other factors in the social system. 


1, The total range of speech varieties found among urban 
and rural, educated and uneducated, high class and low class 
residents of the Hindi regional language area may be described 
in terms of three strata, each corresponding to its own network 
of communication and having different functions in the social 
system. The dialects spoken at the local or village level form 
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part of a chain of mutually intelligible varieties of Indo-Aryan 
stock, extending through all of North India from the Arabian Sea 
in the West to Assam in the East. They vary geographically 
from village to village and socially from caste group to caste 
group. ® 

Superposed above this chain we find the sub-regional dia- 
lects, varieties which avoid the most divergent localisms and 
are understood over somewhat wider areas. They are the na- 
tive speech of the traditional service and trading castes, re- 
siding in small bazaar towns as well as in the larger urban 
centers, who cater to the needs of the rural population. Vil- 
lagers employ these dialects as a second speech style for inter- 
group or inter-village communication. Grierson’s classification 
based on historical similarity sets up for the present Hindi area 
five groups of varieties called ‘languages’ each of which in turn 
contains several dialects. These groupings are: Western Hindi, 
Eastern Hindi, Rajasthani, Bihari and Pahari (further subdivided 
into Eastern, Central and Western Pahari).? Only the first two 
of these groups show a direct relationship to Hindi, others are 
linguistically closer to neighboring regional languages. Further- 
more, as with the local dialects, language distance increases 
with geographical distance, so that there is mutual unintelligi- 
bility among speech forms at opposite extremes of the region. 

Before the advent of the British, in Muslim times, many of 
the sub-regional dialects had great importance as the literary 
media for the then dominant devotional and court poetry. The 
social function of these literatures was quite limited, however, 
and different from that of a modern standard. We have no evi- 
dence for the existence of a single prestige style or range of 
styles which served for all types of writing and formal discourse 
as is the case in modern western societies, but rather there 
was a series of quite distinct styles, each used for a particular 
purpose. Administration was carried on in Persian or Sanskrit, 
Vedic was used for traditionai Brahminical rituals, trading com- 
munities had their own special medium of communication and 
there seem to have been almost as many co-existing different 
literary styles as there were distinct religious cults or literary 
traditions. Each of these literary styles in a way served as the 
symbol or trade mark of a particular group. Compositions 
were intended primarily for oral presentation and were spread 
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beyond the original dialect area through the religious establish- 
ments, migrant preachers, ascetics and performers belonging 
to that group. Thus Braj Bhasha was the literary vehicle for 
the poets of the Krishna cult of which Surdas and Mirabai are 
the best known, Avadhi was the language of Tulsi Das’s Rama- 
yan, and Maithili is famous for the Vaishnavite poetry of Vidya- 
pati. Literary activity of this type now lives on only in oral folk 
tradition and has all but lost its former social importance. The 
medieval literary dialects however still receive a great deal of 
scholarly attention. Moreover some of the better known, e. g. 
Maithili, Avadhi, or even Grierson’s historical constructs, 
Rajasthani and Bihari, continue to be listed among the major 
languages of India. '® 

The third stratum is that of the standard, which we shall 
refer to by the collective term Hindi-Urdu. The varieties of 
this stratum are in turn superposed above the local and sub- 
regional dialects. They are the native speech of only a minority 
of urban residents; for most people they serve as a second or 
third speech style. Modern Hindi-Urdu had its origin in the 
speech of the Gangetic Doab, just north and northeast of Delhi, 
in the districts of Meerut and Moradabad. Soon after the Mus- 
lim conquest, a trade language, based on this speech, became 
accepted as the lingua franca of the courts, army camps and 
trading centers of the new ruling groups. The use of this lan- 
guage for trade purposes spread through much of North India, 
into many districts where the native dialects were quite differ- 
ent, as far as the coastal district of Bengal and Gujerat and 
Maharashtra. Some hundred years later we find records of 
several literary styles bearing various degrees of similarity to 
the spoken trade idiom. Among these are the Khari Boli of 
Amir Khusrau and later of Kabir, the Rekhta of Delhi, the 
Dakini of the Muslim courts of the Bahmani kingdoms in the 
Deccan, where the local population spoke a Dravidian tongue, 
and several others. Throughout this early period these styles 
were similar in function and distribution to the sub-regional 
dialects described above. Compositions were confined to a 
particular form of religious or court poetry, which was not the 
only, but one of many coexisting literary traditions in the area. 
As a matter of fact, during the seventeenth and eighteenth cen- 
tury Khari Boli poetry was largely replaced by Braj Bhasha. 
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The development of the modern standard language did not really 
begin until the second quarter of the nineteenth century, after 
the consolidation of British rule, when literary Urdu replaced 
Persian as the official language for the purpose of local admin- 
istration. After that time a new Urdu prose style developed; the 
language came to be taught in schools and greatly gained in pres- 
tige. Because of its association with Islam, however, the new 
style was not universally accepted. While many Hindus were on 
the whole willing to use Urdu for the purpose of business and 
administration, few accepted it as a literary language. A sec- 
ond prose style was created by discarding the Persian alphabet 
in favor of the Devanagari script and substituting Sanskrit bor- 
rowings for the Persian loan words. The new Hindi for some 
time became the symbol of the rising forces of reform Hinduism 
as represented by the religious centers of Benaras and Allaha- 
bad, the Arya Samaj, and the Brahmo Samaj movements. Reli- 
gious reform and Hindi education went hand and hand and a wide- 
spread system of Hindi vernacular schools and colleges was set 
up paralleling the Urdu school system. By the end of the nine- 
teenth century the two new standards had all but replaced the 
sub-regional dialect literatures throughout the present Hindi 
area. But they did not spread into all regions where the Hindu- 
stani bazaar language was employed. In Gujerat, Bengali and 
Bombay other literary styles were already well established 
while spoken varieties of Hindi continued to function as trade 
idioms with little prestige.'! After independence in 1947, 

Hindi replaced Urdu throughout the present regional language 
area as the sole official medium of administration and education 
and is now learned by all. Urdu literature however continues to 
flourish. 

With the increasing tempo of urbanization and the recent 
improvements in communication, the varieties of the standard 
stratum are beginning to make greater inroads among the local 
dialects. Native speakers of Hindi-Urdu still represent a 
minority, although an ever increasing one. They are found 
primarily among the westernized, urban groups, who have 
moved away from their home region and have lost connection 
with their rural past. For a much larger number of individuals 
the standard continues to function as a second or third speech 
style, used only in certain social situations, e. g. on formal 
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occasions, with individuals of different social background, in the 
office, in school, on the college campus etc. But the number of 
situations which call for the use of the standard is growing and 
sub-regional dialect forms are being more and more confined to 
the family circle. 

The existence of the standard as a common prestige form 
has a profound effect on the region as a whole. It makes it pos- 
sible to overcome local variations in speech and culture. A per- 
son who knows Hindi or Urdu not only has little trouble in making 
himself understood wherever he travels in the area, regardiess 
of local dialect, but he is also certain to find many other indi- 
viduals who read the same literature and in general share his 
intellectual interests. Even among those who barely know it the 
standard has a certain prestige position. We find it used for 
example in public meetings with audiences made up entirely of 
uneducated monolingual villagers. In the traditional Muslim 
‘Milad’ the Maulvi is required to preach in a highly persianized 
Urdu and in certain Hindu meetings of the ‘Kirtan’ type the audi- 
ence expects a very sanskritized form of Hindi. The function of 
the standard on these occasions is primarily ritual since very 
little concrete information is conveyed. 

The relation between the standard and the sub-regional and 
local dialects may be likened to that between Redfield’s great 
and little tradition.'2 The former serves both as a model for 
prestige imitation and as a channel of communication through 
which new concepts are funneled into the other two levels. Lin- 
guistically these innovations are reflected in loan words and 
phonological and morphological borrowings, leading to the re- 
inforcement of those local dialect features which are similar to 
the standard and to the creation of differential rates of linguistic 
change on two sides of a regional language boundary. Tradi- 
tional dialect surveys which tend to concentrate on old or anti- 
quated speech, disregarding newly borrowed features show 
little evidence of the above processes. But if we were to make 
a modern study of every day speech, say at three equidistant 
points, A, B, C, along our North Indian chain of mutually intel- 
ligible local dialects, where Ais separated frum B and C bya 
regional language boundary, we might expect the overall lan- 
guage distance between A and B —— measured in terms of the 
totality of grammatical, lexical and semantic differences —- to 
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be greater than that between B and C. Speakers in the latter two 
locations should share a number of learned and elegant features, 
borrowed from the regional language, which we would not find 

in A. The standard thus acts as a unifying force linguistically as 
well as culturally, a common mold which counteracts diversity 
at the local level and separates the regional language from other 
neighboring districts. 

If we consider the group of standard speakers from the point 
of view of communication, we find that it has a number of char- 
acteristics which set it off from the rest of the population. !? 
Those who know only the dialects still live, so to speak, in rela- 
tively separate isolated and traditionally oriented speech com- 
munities. Their contacts with the outside world — if we leave 
aside the intra-local networks — tend to cluster around neigh- 
boring towns or urban centers, and even when they do visit these 
centers they do not usually come in touch with persons of differ- 
ent regional and social origin. Participation of these dialect 
groups in political life is limited. 

Conditions at the regional language level are different. The 
large Indian cities derive their population from widely divergent 
sources, including speakers of different sub-regional dialects 
as well as persons from other regional language areas. The 
Marwaris of Rajasthan, Sikhs of Punjab and other such commu- 
nities have large colonies in almost every urban center between 
Bombay and Calcutta and these colonies maintain regular con- 
tacts. Recent anthropological research shows that even among 
the laboring classes we find groups which migrate thousands of 
miles in search of work. There is also an increasing group of 
professionals and government employees who shift from city to 
city. These are the people among whom we find the greatest 
percentage of native speakers of the standard. Politically they 
constitute the active and innovating groups of the populatica, 
those among whom new ideas of national independence and reli- 
gious reform arose and took root. Since Hindi and Urdu act as 
symbols of these reform movements, villagers and small town 
residents who aspire to participate in them have gone out of 
their way to learn the new styles. Although there are no statis- 
tics to prove this, it seems reasonable to assume that a high 
proportion of speakers of the standard are literate. Literacy 
provides direct access to newspapers and related means of mass 








100 


communication. Radio and the cinema are available to urban 
literates and non literates. All these media help to lend unity to 
the urban speech stratum and set it apart from the countryside 
which is as yet little affected.'* If we were to compare the com- 
munication density — measured in terms of personal contact 
and mass media — between distant urban centers as Patna and 
Delhi, or Kanpur and Jubbalpur, we might find that it is equal to 
or even greater than that between geographically less distant 
areas, say a village in the Awadhi region around Lucknow and 
another in the Braj area near Aligarh in U. P. There is there- 
fore some justification for speaking of a single urban speech 
community connected by a super-regional network of communi- 
cation which extends from one urban center to another, without 
directly touching the intervening rural areas. 

The average individual living within this community does 
not confine himself to a single style, but has at least some know- 
ledge of all. Literary Hindi and literary Urdu in their extreme 
forms are cultivated by separate social groups and are consid- 
ered separate courses of study. Among the educated there are 
some who read only writings in the Devanagari script; a second 
group knows only the Perso-Arabic alphabet; a third reads both 
and others again are comfortable with neither style and do most 
of their reading in English. Since independence Hindi has be- 
come compulsory in schools but Urdu continues to be used ex- 
tensively in commerce and the Ghazal, the best known form of 
Urdu poetry, is universally popular. If we look at the modern 
realist Hindi writers, we find that they utilize both Sanskrit and 
Persian borrowings. The juxtaposition of the two styles serves 
to express subtle shades of meaning and to lend reality to their 
writings. Similarly on the conversational level the use of Hindi 
and Urdu forms is not simply a matter of birth and education. 
But, just as it is customary for individuals to alternate between 
dialect and standard depending on the social occasion, so when 
using the standard itself the speaker may select from a range of 
alternatives. Hindi and Urdu therefore might best be character- 
ized not in terms of actual speech, but as norms or ideal behav- 
ior in the sociologist’s sense.'* The extent to which a speaker's 
performance in a particular communication situation approxi- 
mates the norm is a function of a combination of factors such as 
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family background, regional origin, education and social attitude 
and the like. 

In our linguistic description we will try to observe the dis- 
tinction between ideal and actual behavior by comparing our texts 
collected by ordinary linguistic field methods with normative 
styles, those features which are regarded as suitable for formal 
communication and serve as models for actual speech. The fact 
that we draw this distinction does not mean that we intend to re- 
vert to an uncritical acceptance of the traditional prescriptive 
grammar, but rather we include only those features which seem 
to us to have a real prestige function. Modern linguists have 
shied away from the study of prescriptive grammar in recent 
years on the basis of their experience with Western languages 
where these grammars too often represent grammatical fictions 
adhered to by certain groups of grammarians and having no real 
currency in the community at large. In India this is not the case. 
Literary styles are learned through many years of oral mimicry 
memorization — both at home and in the class room — and many 
of the phonemic distinctions not part of the common core are 
acquired in this way. 

A detailed study along the above lines would of course re- 
quire extensive field work. Normative styles would have to be 
validated for each region with questionnaire techniques of the 
type used by social psychologists and would then have to be com- 
pared with an adequate sample of language texts collected under 
carefully controlled interview conditions. The present paper is 
a preliminary statement, based on a limited amount of data, 
which might help to define some of the research problems in- 
volved and isolate variables that could be tested in a more de- 
tailed study. Information on normative styles was collected by 
consulting traditional grammars and in consultation with those 
among our informants who could qualify as experts. The main 
body of the texts was collected from seven native informants, 
from different parts of the Hindi regional language area. In- 
formant A is a native speaker of Punjabi who has lived most of 
his life in Delhi. Informant B is from the Meerut district of U. 
P., C is from Lucknow, F and G from the Bhojpuri regional 
dialect area of U. P. Dis an additional speaker of Bhojpuri 
from Bihar and E is a native speaker of Maithili from Bihar. 

All of them are well educated and have had some formal training 
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in one or both of the two literary styles. C and F are Muslims 
trained in Urdu. C has an advanced degree in the language and 
has also studied some Hindi. G and D are teachers of Hindi and 
Sanskrit. A, B, and E are graduate students in other subjects 
with some training in the literary styles. 

Except for G and E each informant was subjected to three 
sets of interview conditions, a) informal conversation with friends, 
b) reading a passage of prose in a formal style containing features 
associated with the literary Hindi under conditions approximating 
those of a lecture to a learned audience, c) reading of a similar 
passage in literary Urdu. Whenever possible the literary pas- 
sage was given in the original script, but for informants who did 
not know the script, passages were put into romanized transcrip- 
tion. 


2. The formal and informal styles of Hindi-Urdu differ in 
phonology, morphology, syntax and lexicon. In this paper we 
confine our discussion to the most important items in the first 
two of these categories. 


2.1 Maximum Phonemic Inventory: 

The table below is a listing of the total range of contrasting 
segmental phonemes occurring in the data. There are some 
speakers whose speech shows all these contrasts. The majority 
however does not have all. Segments not found in all idiolects 
are placed in parenthesis. 


Consonants: 
bilabial apico- apico lamino- dorso- back 
dental post- palatal velar dorso- 
dental velar 
voiceless pp, ph t, th t, th c, ch k, kh (k) 
Stops 
voiced b; bh. d;.dh d, dh j, jh g, gh 
voiceless (f) s (s) (3) (x) h 
Spirants 
voiced Vv (z) y (g) 
Nasals m n (n) ~ (nasalization) 
Laterals 1 


r (trill) (r) (flap) 
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Vowels: 
Front Central Back 
High i u 
I U 
Mid € a fe) 
(E) 
Low ze 3 
a (a) (unrounded) 


Consonants and vowels may occur initially, medially, and 
finally with the exception of /n/ and /r/ which do not occur ini- 
tially. Aspirate stops do not occur as geminates. All other 
consonants except /h/, /x/, /r/, /g/, /8/, /s/ and /n/ appear as 
geminate or double consonants, e. g. /pata/ ‘address’, / patta/ 
‘leaf’. /v/ and /y/ which are traditionally treated as semi- 
vowels are here included among the consonants since they pat- 
tern like consonants. 

The approximate phonetic range represented by each symbol 
is indicated by its position in the chart. Additional information 
is given in the text whenever relevant. /~/ indicates nasaliza- 
tion and is written above the vowel symbol, e. g. /h&/ ‘are’, 
/hee/ ‘is’, Of the vowels, /I/, /E/, /a/, and /U/ are always 
short. The others vary in length depending on the environment 
in which they occur. 


Consonant Clusters 

The table on page 104 lists all initial and final two consonant 
clusters which occur in the Hindi-Urdu texts. Initial clusters 
are shown at the top of the square, final clusters at the bottom. 
An asterisk indicates items occurring in Urdu, a small circle 
those found in Hindi. Distributional details will be discussed in 
section 2.2. 
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2.2 Phonological Sub-Systems. 

There are a number of speakers whose speech shows all the 
contrasts in the maximum inventory; the majority however does 
not have all of them. If we subtract the segments with limited 
distribution from the maximum inventory, we arrive at what we 
may call the common core, representing the minimum number 
of contrasts necessary for a native speaker of Hindi. 

Historically, segments not in the common core did not form 
part of the local dialects from which the modern standards devel- 
oped. They came into the system through the influence of other 
literary languages. /#8/ occurs in loan words from Sanskrit (Sk.), 
Persian (Pe.), Arabic (Ar.), English (En.), and other Western 
European languages, e. g. /Sobha/Sk. ‘beauty’, /3adi/Pe. ‘mar- 
riage’. /f/ and /z/ were introduced through Persian, Arabic, 
English etc., e. g. /foran/Ar. ‘immediately’, /farm/En. ‘farm’, 
/bazar/Pe. ‘market’, /rezar/En. ‘razor’. /k/, /x/, and /g/ 
occur in words of Perso-Arabic sources only, /kerib/Ar. ‘near’, 
/xUda/Pe. ‘god’, /geb/Ar. ‘secret’. /n/ and/s/ are found ex- 
clusively in direct borrowings from Sanskrit, e. g. /karan/Sk. 
‘reason’, /kast/Sk. ‘trouble’. Two additional items occur only 
in English loan words. These are /E/ and /a/, e. g. /cEk/En. 
‘check’, /bal/En. ‘ball’. The position of /r/ is somewhat dif- 
ferent. The original local dialects seem to have had two phonet- 
ic segments [r] and[d] which were in complementary distribu- 
tion, [r] occurring medially between oral vowels and finally, 

[d] occurring elsewhere. English borrowings such as /soda/ 
‘soda water’ however introduced contrasts between these two 
allophones and as a result the distinction became phonemic. 

Although we refer to the above segments as loans it should 
be remembered that most of them have been in common use 
since quite early in the history of the modern standards and by 
now are regularly part of everyday speech. Their occurrence 
and non-occurrence however forms a convenient basis for dis- 
tinguishing sub-systems within the maximum inventory. 

The normative Urdu pronunciation may be defined as the 
pronunciation considered suitable for formal speech making or 
reading of literary Urdu prose. It consists of the common core 
plus the following segments: /f%xzgkr/. Although norma- 
tive Urdu is traditionally taught with reference to the writing 
system it does not incorporate all the distinctions made in the 
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latter. The letters svad, se, and sin, originally Arabic /s @ s/ 
respectively, are all pronounced alike as /s/. zvad, ze, zal and 
zo which in Arabic represented /d z dz/ are al! pronounced /z/. 
te and to, Arabic /t t/ are pronounced /t/. ain and hamzah which 
were Arabic /‘ ?/ respectively are no longer pronounced as 
consonants. 

A glottal stop [?] is occasionally heard in the speech of older 
traditionally trained individuals, e. g. [ba?id] for /baid/ ‘far’ 
(spelled with ain), but this pronunciation seems to carry little 
prestige. The pronunciation of Sanskrit loan words is regularly 
assimilated to the Urdu system Sk. /n/ becomes /n/ and Sk. /s/ 
becomes /8/, e. g. /karan/ ‘because’ /bhaSa/ ‘language’. 

The normative Hindi system, the pronunciation considered 
suitable for formal Hindi, consists of the common core plus: 

/n/, /8/, /r/ and/s/. Traditional pandits also teach a high cen- 
tral vowel pronunciation [i] in the sequence [ri], in words writ- 
ten with the corresponding syllabic r symbol in Sanskrit. But 
this feature seems to have little real prestige function and might 
be considered part of a special pandit style. On the level of 
phonemic distribution, normative Hindi differs from Urdu by the 
use of final short vowels /I/ and /U/ as in /patI/ ‘husband’ and 
/kIntU/ ‘but’. Both styles share final short /a/ as in /na/ ‘not’. 
Whereas in normative Urdu consonants in Sanskrit borrowings 
are generally assimilated, the position of Persian loan segments 
within the normative Hindi system will require some clarifica- 
tion. Traditional grammars treat these sounds as being outside 
the system proper and there are some modern writers of Hindi 
who object to the use of special Devanagari symbols to repre- 
sent them. In practice /k/ /g/ and /x/ are generally assimi- 
lated, /f/ and /z/ however are quite common at least in Western 
U. P. and those who do not pronounce them properly are often 
ridiculed. More detailed sociolinguistic investigation into the 
prestige position of these items should be of great interest. 

The vowel segments /E/ and /a/ as in /cEk/ ‘check’, /bal/ 
‘ball’ are found primarily in the speech of English speaking bi- 
linguals. Their position with respect to the two normative 
systems is uncertain. Quite possibly more detailed study might 
isolate a third, Westernized Hindi-Urdu normative style char- 
acterized by /E,a/ as well as /z, f, 5, r/. 

In addition to the differences in puonemic inventory and 
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distribution, normative Hindi and Urdu show a number of impor- 
tant distinctions in consonant clusters. For Urdu our cluster 
chart lists only a limited number of initial sequences of the 

type consonant plus 1, r or y. Hindi shows many more items in 
this category and in addition has several other cluster types. 
Among the more distinctive Hindi initial clusters we might list 
the following: 


k3 k3oma ‘forgiveness’ 

st stUti ‘praise’ 

sv svorg ‘heaven’ 

Sr Sri ‘Mr.’ (Urdu /8Iri/) 
sn sneh ‘affection’ 

ny nyay ‘justice’ 


The number of final clusters is larger than that of the ini- 
tial ones in both styles. Many of these clusters are shared, but 
each has a few distinctive sequences. Aside from combinations 
involving loan phonemes, the following clusters might be con- 
sidered as distinctively Urdu. a) Combinations of m plus non- 
homorganic stop or spirant: mt, md, ms, mn. b) Sequences 
with final 1; Hindi has only ml in this category, Urdu has bl, tl, 
kl, sl. c) Clusters with final s: Hindi has only ts, Urdu has ps, 
rs, ms. d)h plus consonant, hr, hm: 


mt sImt ‘direction’ 

ms lems *touch’ 

kl §akl ‘shape’ 

hr Seehr ‘city’ (Hindi /Saher/ or 


/ Sahoar/ 


Most important among the distinctive final Hindi clusters 
are sequences of consonant plus vand y. These do not occur 
in Urdu: 


tv mehetv ‘importance’ 

Sv a8v ‘horse’ 

ny any ‘other’ 

ly muly ‘price’ 

ry gerv ‘pride’ 

jy rajy ‘rule’ (Urdu raj) 


ry sury ‘sun’ 
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It is interesting to note that many of the sequences with 
final /y/ are re-sanskritizations of borrowings which in earlier 
Hindi texts appear with simple consonant endings. Thus 
Bharatendu Harischandra, the most important of the early Hindi 
writers, uses /raj/ not /rajy/ ‘rule’.'® 

The two styles further differ with respect to clusters invol- 
ving aspirate stops, which were not listed in the chart. In Urdu 
aspirate stops occur only in final clusters after nasals. Hindi 
has the following types of additional sequences: 


rth earth ‘meaning’ 
rbh gerbh ‘embryo’ 
rkh murkh ‘fool’ 

khy mUkhy ‘ chief’ 


2.3 Observed pronunciations. 

In this section the normative pronunciations are compared 
with field data from India and the formal and informal interview 
texts collected for this study. Differences among informants 
are classified according to the level of structure at which they 
appear, i. e. according to whether they affect the phonemic in- 
ventory, phonemic distribution or whether they are phonetic. 


Differences in Phonemic Inventory. 

The majority of the uneducated illiterate speakers of Hindi 
and the regional dialects ordinarily replace the borrowed seg- 
ments /8/, /f/, /x/, /z/, /g/, and /n/ by /s/, /ph/, /kh/, /j/, 
/g/, and /n/ respectively, and /d/ and /r/ for them are allo- 
phones of the same phoneme. With the educated informantss 
recorded for this study the situation is more complex. All of 
them regularly distinguish between /3/ and/s/. The segment 
/k/ occurs only with informants C and F, both trained in Urdu. 
All others do not have it. The most common reflexes found in 
words with an original /k/ were /x/ before voiceless stop and 
/k/ elsewhere, e. g. /kanun/ for /kanun/ ‘law’, /vext/ for 
/vakt/ ‘time’. 

Informants C and F also regularly observe the contrast 
between /g/ and /g/, e. g. /gUl/ ‘noise’, /gUl/ ‘flower’. E 
uses /g/ in all words of this type, and thus has no phoneme 
/g/. With A and B the position is as follows. There are some 
items in which the pronunciation varies between [g] and([g], 

e. g. [golat]~[ galat] ‘wrong’. Others such as [ganga] 
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‘Ganges’ always have [g].. The phonetic alternation is confined 
to a limited list of items and is not predictable in terms of the 
phonological environment. It is necessary therefore to set up 
/g/ as part of the phonemic inventory of informants A and B, 
although it is marginal. Items like the word for ‘wrong’ above 
are said to have two phonemic forms: we write /galat/ when 
[g] is heard and /galet/ when we hear [g]. 

The segment /x/ regularly occurs with informants A, B, C, 
and F. DandE show variations similar to the /g/ and /g/ 
alternation described above, e. g. [xas]~[khas] ‘special’. /x/ 
is therefore phonemic in their speech. 

Similar alternations between[j]and[z], and[ph] and [f] 
are found with informants B, D, and E, e. g. [cij ]~[ciz] ‘thing’, 
[kaphi]~[kafi] ‘enough’. ‘The distinctions between [j] and[z] 
and [ph] and[f] are therefore phonemic for them. 

The contrast between /n/ and /n/ is regularly found only 
with E, B, A, and D. Informant C varies between [n] and [n] 
in certain words. /n/ is a marginal phoneme in his dialect. F 
never shows [n]. The contrast between /8/ and /s/ was heard 
only with informant E and his speech shows variation between 
[3] and[s] in words with Sk. /s/. 

An interesting feature of the alternations between ‘loan’ and 
‘native’ phonemes is their apparent irregularity in certain loan 
words and their regularity in others. Informant E for example 
regularly has /j/ in /jlla/ ‘district’ and /jerur/ ‘certainly’, 
but alternates between [z] and[j] in other items like /zyada/ 
‘more’. The consistent use of either of these two phonemes in 
any particular items suggests that these items were borrowed 
at a quite early stage directly into the regional dialects. The 
/f£/ and /ph/ alternation for some informants extends also to 
certain words which are Indic in origin and should have / ph/ 
consistently, e. g. /phIr/~/fIr/ ‘again’, /phoel/~/fal/ ‘fruit’. 


Differences in Phonemic Distribution. 

The most important difference in this category is in the 
distribution of vowels. Informants D, E, have contrasts be- 
tween /a/, /i/, /u/ and/a/, /I/, /U/ in final position, e. g. 
/na/ ‘no’ /ka/ ‘of’ /yadI/ ‘if’ /nedi/ ‘river’, /kIntU/ ‘but’ 
/babu/ ‘clerk’. Other informants have final /a/ and /a/ but 
only one high front and high back vowel /i/ and /u/ respective- 
ly in this position. D and E further have the final sequence 
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/ah/ in a number of polysyllabic words including words of both 
Indo-Aryan and Perso-Arabic origin, e. g. /barah/ ‘twelve’, 
/terah/ ‘thirteen’, /jagah/ Per. ‘place’, /tarah/ Ar. ‘kind’. The 
remaining informants seem not to have this final /ah/. Corre- 
spondences vary between /a/ or /ha/ in words of IA origin like 
the numerals and further also include /e/ or /he/ in some of 
the loan words, /bara/~/barha/ ‘twelve’, /jaga/~ /jagha/~ 
/jage/~/jaghe/ ‘place’. 

There are other alternations in phonemic distribution of 
varying degree of importance which cannot be discussed here in 
any detail but which might be worthy of further investigations. 
One might mention the variation between single and double con- 
sonants in items like /budha/~/bUddha/ ‘old’. Similarly of in- 
terest are the alternations between /k3/ and /kkh/, e. g. 

/ dak3In/~ /dakkhIn/ ‘south’, between /v/ and /b/, e. g. /van/~ 
/ban/ ‘forest’, between nasalized and non-nasalized vowels, 

e. g. /it/~ /it/ ‘brick’, and many more. 

Phonetic Differences. 

On the phonetic level the following variations were found. 
With informant A the phoneme /a/ has allophones ranging be- 
tween [+] and[a] before certain retroflex consonants. Other 
informants vary between [a] and [A] in those environments. 
Allophones of /3a/ before /h/ cluster around [z] or [e] for most 
informants. E and D have [A] in this environment. The allo- 
phones of /i/, /I/, /e/, /u/, /U/, and /o/ are comparatively 
high and tense with D, E, and F. They are lower and more lax 
with A, B, and C. 

/>/ and /z/ represent monophthongal low back and front 
vowels in the case of informant A. With B and C the pronunci- 
ation is diphthongal with a low vowel onset followed by a very 
short upward glide [ee] and[s0]. The diphthongal glide with 
a central vowel onset, [ac] and[ao0], is pronounced with in- 
formants D, E, and F. 

/y/ and /v/ are pronounced with relatively strong conso- 
nantal friction by A, B, and C. Informants D, E, and F havea 
great deal more of vowel color. 

The tongue position of the retroflex consonants /t/ and /d/ 
is most retracted and the preceding vowel has most retroflex 
color with informant A. Informants B, and C have less re- 
tracted tongue position and D, E, and F have least retroflection. 
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The progressive loss of phonetic distinctiveness of the dental- 
retroflex contrasts is of interest in view of the fact that the 
dialects of Bihar are closest to Assamese, which does not have 
this contrast. 

Analysis of the pronunciation differences observed so far 
shows that the occurrence and non-occurrence of borrowed seg- 
ments is related both to education or family background and to 
regional origin. The two Moslems C and F who have had formal 
training in Urdu agree in the consistent use of Perso-Arabic 
loan segments. All Hindus and C who has had some Hindi 
training have /n/. With informants from the West /z/, /f/, and 
/x/ are regular parts of the phonemic inventory, whereas with 
D and E from Bihar these contrasts are marginal. This seems 
to reflect the fact that Indo-Islamic influence has been most 
profound in the West and Urdu literature most widely accepted. 
Differences on the phonetic level on the other hand are corre- 
lated primarily with regional origin. Informant A who has 
lived in Delhi most of his life is set off from all others by his 
pronunciation of /a/, /ze/, /2/, /t/ and by forms such as /terha/ 
‘thirteen’, features which seem characteristic of the sub-re- 
gional dialects in the Hindi-Punjabi border area. Similarly D, 
E and F, the two Hindus and the Moslem from Bihar show sim- 
ilarities in the pronunciation of vowels, of /t/ and /d/, /y/ and 
/v/ and of final /h/. All these features are also found in the 
sub-regional dialects of Bihar. 

The data regarding situational variants in pronunciation is 
meager. Informant C uses /n/ only in situations calling for 
normative Hindi. The speech of informant D shows a much 
higher incidence of /z/ and /f/ in conversations with Moslems 
and Westerners; /j/ and /ph/ are more prevalent when he talks 
to his fellow Hindu from Bihar. Both authors have observed 
additional situational variants in India with older Hindus, 
trained in both styles. These individuals use /k/, and /g/ only 
in situations which call for normative Urdu. With other in- 
formants interviewed for this study variations of this type were 
less pronounced. More detailed investigation is needed with 
closer specification and better control of interview conditions. 

There is furthermore little correlation between formal and 
informal interview situations and the pronunciation differences 
discussed so far. Formal-informal differences are more 
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clearly evident in the pronunciation of consonant clusters. Un- 
educated low prestige speech shows the smallest number of 
clusters. In this style only sequences of consonant plus /y/ occur 
initially and finally there are only clusters of nasal plus homor- 
ganic stop or sibilant plus stop. Clusters in common loan words 
are replaced as illustrated in the patterns below: 


/glas/ ‘glass’ / gllas/ 

/skul/ ‘school’ /Iskul/ (East) or /soakul/ (West) 
/kSama/ ‘forgiveness’ /chama/ or /3ama/ 

/xetm/ ‘end’ / xatam/ 

/3ram/ ‘labor’ / Saram/ 

/%arm/ ‘shame’ / $aram/ 

/neagd/ ‘cash’ /naked/ or /nagad/ or /nagd/ 
/rasm/ ‘tradition’ / rasam/ 

/$Ukr/ ‘thanks’ /8$UkUr/ or /3Ukor/ 

/prem/ ‘love’ /pIrem/ or / parem/ 

/rajy/ ‘rule’ / raj/ 


/smeran/ ‘remembrance’ /sUmiran/ 


Educated speakers regularly use initial clusters of the 
types Cr, Cy, Cl, and Cv, except for occasional variations in 
individual words such as that between /plet/ and /palet/ ‘plate’. 
Initial sC sequences however tend to be replaced by VsC in in- 
formal contexts, e. g. /skul/~/Iskul/ ‘school’, /ste3an/~ 
/IsteSan/ ‘station’. In final position similar formal-informal 
alternations are found in /kIsm/~/kIsIm/ ‘kind’. /baks/~ 
/bakes/ ‘box’ /janm/~/janam/ ‘birth’. With a number of 
final clusters, alternations with consonant vowel consonant se.. 
quences are possible in some words but not in others. Thus 
educated speakers may say /baraf/ or /barf/ ‘snow’, but al- 
ways use /sIrf/ ‘only’. Similar examples are: /taxt/ ~ /taxat/ 
‘throne, bench’ versus /saxt/ ‘hard’; /kIik/~/kIIIk/ ‘reed’ 
versus /mUlk/ ‘country’; /bhasm/ /bhasam/ ‘ashes’ versus 
/jIsm/ ‘body’; /muft/~/mUfat/ ‘free’ versus /koft/ ‘disgust’; 
/meakr/~ /mokker/ ‘deceit’ versus /3Ukr/ ‘thanks’. A third 
set of items always shows final clusters: /%abd/ ‘word’, 
/%axs/ ‘person’, /svarg/ ‘heaven’. 

Clusters not shared in both normative styles tend to be 
replaced in the style in which they do not occur. Some com- 
mon instances are: U. /8Iri/ for H. /Sri/, U. /raj/ for H. 
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/rajy/ ‘rule’, U. /krl3an/ for H. /krI8n/ ‘Krishna’, H. /%aher/ 
for U. /Seehr/. 


2.4 Morphological Differences. 

A complete analysis of morphological variants within Hindi- 
Urdu is beyond the scope of this paper. We will note here only 
the most striking differences. 

One major distinction is found with demonstrative pronouns. 
Normative Hindi has four different forms, which are kept dis- 
tinct in spelling and pronunciation: 


/yah/ ‘this’, /ye/ ‘these’ /vah/ ‘that’, /ve/ ‘those’ 


Normative Urdu has only two forms, written yah, vah but pro- 
nounced as below: 


/ye/ ‘this, these’ /vo/ ‘that, those’ 


Informants A and F always use /ye/ and /vo/, C employs the first 
set only in formal Hindi contexts. B, DandE alternate be- 
tween /yah/ and /ye/ and B also changes between /vah/ and 

/vo/ inthe singular. All three keep the plural /ve/ distinct. 

The Hindi normative set is most closely adhered to in formal 
situations. It would also seem that the Urdu system is more 
common in the West, but more information is needed on the de- 
tails of the alternation and on its social function. 

Borrowings from Persian and Sanskrit, in addition to cre- 
ating differences in lexicon, have also brought in a number of 
differences in the inventory of derivational morphemes. Thus 
in Urdu we have: 


-on /nIsbatan/ ‘relatively’ 

-in /rengin/ ‘colorful’ 

-mond /gareazmand/ ‘desirous’ 

-lya /mezaklya/ ‘funny’ 

hem- /hemdoerd/ ‘friend’ (lit. one who shares the pain) 
ba- /bakalda/ ‘according to rules’ 

-nak /xeternak/ ‘dangerous’ 

pUr- /pUrlUtf/ ‘pleasant’ 

dar- /dermlan/ ‘middle’ 


-8Inas /farz8Inas/ ‘conscientious, dutiful’ 
bo4d- /bednam/ ‘notorious’ 
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Some characteristic Hindi affixes are: 


-tah /sambhavetah/ ‘probably’ 

-lu /$raddhalu/ ‘respectful’ 

- 3k /sahayek/ ‘helpful’ 

-akar /golakar/ ‘circular’ 

-Ik /periIvarIk/ ‘domestic, of family’ 
dUr- /dUrdaSa/ ‘bad condition’ 

-ta /kathorta/ ‘brutality’ 

en- /anerth/ ‘useless’ 

sva- /svabhav/ ‘nature’ 

pra- /proakop/ ‘extreme anger’ 

a- / avak/ ‘silent’ 

-man /bUddhIman/ ‘wise’ 

-It /jivIt/ ‘alive’ 

Ud- /Uddhar/ ‘release’ 

-purvek /bhaktIpurvak/ ‘full of piety, faithfully’ 
-tv /mehatv/ ‘responsibility’ 


It is interesting to note that the bulk of the Sanskrit loan 
affixes are freely used only with direct Sanskrit borrowings 
(tatsamas) and the Perso-Arabic loan affixes are used only with 
Perso-Arabic borrowings. Only a few have come to be used 
also with indigenous roots (tadbhavas). Among these are: 


-dar /dukandar/ ‘shopkeeper’, /thekedar/ ‘contractor’ 
be- /bejan/ ‘lifeless’, /bedharak/ ‘fearless’ 

-ana /bUzUrgana/ ‘elderly’, /jogiana/ ‘like a jogi' 
na- /nagevar/ ‘unbearable’, /nasamajh/ ‘unwise’ 
-xana /davaxana/ ‘hospital’, /kuraxana/ ‘garbage pile’ 
on- /anpedh/ ‘illiterate’, /anarth/ ‘meaningless’ 


The first five items in the above list are of Persian origin 
and are used with Persian loans and tadbhavas. The last item 
is from Sanskrit and is used with tatsamas and tadbhavas. We 
have no instance of a Persian loan affix used with tatsamas or 
a Sanskrit loan affix used with Persian loan roots. 

In normative Hindi a number of roots undergo morpho- 
phonemic changes on the analogy of Sanskrit Sandhi, e. g. /ved/ 
‘Veda’, /veedIk/ ‘Vedic’. Traditional prescriptive grammars 
as a rule describe these changes in terms of the Sanskrit sys- 
tem, using the same classification and terminology. In 
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practice the Hindi system is somewhat different from the San- 
skrit —-we may have for example both / rajnitIk/ and / rajnzetIk/ 
‘political’—— but these differences have not as yet been de- 
scribed. 

As with consonant clusters, the speech of the uneducated 
shows the smallest number of derivative compounds. When ar- 
ranged in order of formality the affixes roughly fall into three 
groups. The items in the third set of examples given above and 
few others such as -lya, -bad are in common use. Others in- 
dicate either relatively formal Urdu, e. g. -moand, ba- or rela- 
tively formal Hindi, e. g. -lu, -ok, -akar, -ta. A third group 
of very formal items is productive primarily in writing and 
very rarely in speech, e. g. Urdu -on, pUr- or Hindi -tah, -Ik, 
-purvok-, -tv. In ordinary speech the relatively formal and 
very formal affixes are most usually substituted by phrases. 
Thus /sambhavatah/ ‘probably’ becomes /sambhav hz/ or 
/mUmkIn hee/ or /nIsbatan/ ‘relatively’ becomes /Us ke 
mUkable mé/ etc. 

Among the lexical borrowings from Persian and Sanskrit 
there are also a number of common conjunctions and adverbs 
which have considerable structural importance, and might be 
classed as function words.'? These form another important 
criterion for stylistic distinctions. The listing below gives 
items occurring in normative Hindi, normative Urdu and infor- 
mal conversation. 


Normative Hindi Informal Conversation Normative Urdu 

/yadI/, /agar/ ‘if’ / agar/ / agar/ 

/yadyeapl/ ‘although’ / agar/ / agarcl/, /hal&kI/ 

/tathapI/, /to bhi/ / phir bhi/, /to bhi/ /tab bhi/, /phIr bhi/ 

/phIr bhi/ ‘yet’, ‘nevertheless’ /tab bhi/ 

/evam/, /tatha/, /or/ ‘and’ /or/ /or/ 

/karen/ ‘reason’ /karan/, /vaja se/ /sabab se/, /vaje se/ 
/vaje se/ 

/kIntU/, / parantU/ /magar/, /lekIn/ /magar/, /lekIn/ 


/meager/ ‘but’ 

/sarvda/, /sada/, /nIty/ ‘always’ /sada/, /hameSa/ /sada/, /hameSa/ 
/sehIt/, /sath/ ‘with’ / sath/ / sath/ 

/atah/, /Is lIle/ ‘therefore’ /Is lle/ /cUnancl/, /Is lle/ 
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The items in the normative Hindi and Urdu columns can again 
be arranged into relatively formal and very formal groups. 
Items in the first category are: Urdu /hal&kI/ ‘although’, 
/cUnancl/ ‘therefore’ and Hindi /yadI/ ‘if’ /parantU/ ‘but’. In 
the latter group we find Hindi /tathaplI/ ‘yet’, /evam/, /tatha/ 
‘and’ /kIntU/ ‘but’ /sahIt/ ‘with’, /ateh/ ‘therefore’. It is in- 
teresting to note that none of the function words in the norma- 
tive Urdu column are in the very formal group. A character- 
istic feature of very formal Urdu style is the use of the enclitic 
-e ‘of’ and -o ‘and’ borrowed from Persian, e. g. /sahabe xana/ 
for the more common / ghar ka sahab/ ‘owner of the house’, 
/darde dIl/ for /dIl ka derd/ ‘pain of the heart’, /abo hava/ for 
/heva aur pani/ ‘climate’ (lit. ‘air and water’), /savalo javab/ 
for /saval aur javab/ ‘question and answer’. 


3. Conclusion. 

The standard speech stratum in the Hindi regional language 
area can be described in terms of a single range of styles. 
Within this range we may distinguish a common core, shared 
by all including the uneducated and two, or possibly three, for- 
mal extremes or normative styles which are set off from each 
other by features of phonology, morphology and syntax. In 
actual texts, spoken as well as written, the differences between 
the extremes tend to be blurred somewhat but we may distin- 
guish various degrees of Hindi, Urdu and Westernized coloring. 
The various styles within the standard stratum tend to be 
associated with particular communication roles and the speech 
of any one individual varies accordingly. Some individuals are 
at home in both normative styles, with others the variations 
are more limited. Aside from the Hindi-Urdu differences, we 
find speech differences such as the contrast between /s/ and 
/8/ and the use of certain consonant clusters, which correlate 
with differences in education. Degree of formality and infor- 
mality is signalled by other consonant clusters, by derivative 
suffixes, borrowed enclitics and certain types of function words. 

The exact nature and social function of the above stylistic 
markers varies from region to region. In the West for exam- 
ple the contrast between /z/ and /j/ is a regular part of the 
phonemic inventory of educated speakers; in the East it is 
marginal for many and only persons with special training in 
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Urdu have it regularly. Regional origin is most clearly evi- 
denced on the phonetic level. All speakers from a particular 
area tend to have the same allophones regardless of social back- 
ground. A person from Bihar trained in Urdu might have the 
same phonemic inventory as another Urdu speaker from Luck- 
now, however, in items which are part of the common core, the 
allophonic content of his speech resembles that of other speak- 
ers from the same area who often have different phonemic 
inventories. 


Notes 


1. For a somewhat different and more detailed discussion 
of the general Hindi or Hindustani problem see S. K. Chatterji, 
Indo-Aryan and Hindi (Ahmedabad, 1942). 


2. The variations within Hindi are similar to the differ- 
ences between Colit Bhasha and Sadhu Bhasha in Bengali, de- 
scribed by Dimock and Chowdhury in this volume. 





3. See for example the following: S. H. Kellog, A Gram- 
mar of the Hindi Language e (3rd ed., London, 1938); K. P. Guz Guru, 
Hindi \ Vyakaran aran (Banaras, 1957); A. Sharma, A Basic Grammar 
of Modern Hindi (Delhi, 1958), this is the most recent official 
grammar published by the Government of India, enemas J of 
Education and Scientific Research. 


4. The standard reference work is John T. Platts, A 
Grammar of the Hindustani or Urdu Language (London, 1920). 

















5. See for example A. H. Harley, Colloquial Hindustani 
(London, 1944); M. C. Saihgal, Hindustani Grammar (11th ed., 
Simla, 1945); T. G. Bailey, Teach Yourself Hindustani (London, 
1950). Some of the grammars in our three categories discuss 
both scripts but do not discuss the relations between the styles. 








6. Sir George A. Grierson, Linguistic Survey of India (19 
vols., Calcutta, 1903-28). 


7. The most useful of the traditional treatments is S. K. 
Chatterji, op. cit., but this contains little concrete linguistic 
data. 


8. Cf. J. J. Gumperz, “Language Problems in the Rural 
Development of North India,” Journal of Asian Studies, 16. 
251-9 (1957). For a discussion of similar local dialect chains 
elsewhere; cf. Trevor Hill, “Institutional Linguistics,” Orbis, 
Tome VII, No. 2, 1958, pp. 441-455. See also the discussion 
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of L complex and L simplex in C. F. Hockett, A Course in 
Modern Linguistics (New York, 1955) p. 323 ff. 


9. Sir George A. Grierson, op. cit. 


10. Cf. O. H. K. Spate, India and Pakistan (London, 1954), 
the standard geography of India which lists Bihari and Western 
Hindi along with Gujerati, Bengali, etc. 


ll. S.K. Chatterji, who suggests that Bazaar Hindi be made 
the national language, does not consider these prestige relation- 
ships. 


12. Cf. R. Redfield, Peasant Society and Culture (Chicago, 
1956), p. 70 ff. 


13. For a general discussion of relevant communication 
problems see K. W. Deutsch, Nationalism and Social Commu- 
nication (New York, 1953). 


14. Cf. Deutsch’s concept of mobilized population, ibid., 
p. 99 ff. 


15. Cf. K. Davis, Human Society (New York, 1948), p. 55 

















ff. 


16. Cf. R. C. Shukla, Hindi Sahitya Ka Itihas (Banaras, 
1955), p. 529. 


17. Cf. the change from ph to f in 19th century Bengali de- 
scribed in S. K. Chatterji, Origin and Development of the 
Bengali Language (Calcutta, 1836); Vol.I, p. 442. 


18. Cf. K. P. Guru, op. cit. 











19. For the use of this term see C. C. Fries, The Struc- 
ture of English (New York, 1952). 

















4- 








INDIANA UNIVERSITY PUBLICATIONS IN 
ANTHROPOLOGY AND LINGUISTICS 


. Paul Radin, Winnebago Hero Cycles: A Study in Aboriginal Literature 


(168 pp., 1948) $2.00 


. Paul Radin, The Culture of the Winnebago: As Described by Themselves 


(119 pp., 1949) $2.00 


. Paul Radin, The Origin Myth of the Medicine Rite: Three Versions. The 


Historical Origins of the Medicine Rite (78 pp., 1950) $2.00 


. Harold E. Driver and S. H. Riesenberg, Hoof Rattles and Girls’ Puberty 


Rites in North and South America (33 pp., 1950); and Hilda J. Curry, 
Negative Painted Pottery of Angel Mounds Site and its Distribution in 
the New World (57 pp., 1950) $1.00 


. L. S. Freeland, Language of Sierra Miwok (205 pp., 1951) $3.00 
. Claude Lévi-Strauss, Roman Jakobson, C. F. Voegelin and Thomas A. 


Sebeok, Results of the Conference of Anthropologists and Linguists (86 
PP-, 1953) $1.00 


. Harold E. Driver, John M. Cooper, Paul Kirchhoff, Dorothy Rainier 


Libby, William C. Massey, and Leslie Spier, Indian Tribes of North America 
(Map + 30 pp., 1953) $1.00. Outline maps 11 x 17 inches may be purchased 
separately 


. Charles F. Hockett, A Manual of Phonology (246 pp., 1955) $3.50 
. Eric H. Lenneberg and John M. Roberts, The Language of Experience (33 


PP., 1956) $1.00 


. Charles F. Voegelin and Florence M. Voegelin, Hopi Domains: A Lexical 


Approach to the Problem of Selection (82 pp., 1957) $2.00 


. Sarah C. Gudschinsky, Proto-Popotecan: A Comparative Study of Popolo- 


can and Mixtecan (118 pp., 1959) $2.00 


. George William Grace, The Position of the Polynesian Languages Within 


the Austronesian Language Family (77 pp., + Map, 1959) $2.00 


Orders for Memoirs may be sent to International Journal of American Lin- 
guistics, Department of Anthropology, Indiana University, Bloomington, 
Indiana 











i 





PUBLICATIONS OF THE RESEARCH CENTER 
IN ANTHROPOLOGY, FOLKLORE, AND LINGUISTICS 


. Edward Sapir and Morris Swadesh, Native Accounts of Nootka Ethnog- 
raphy (457 pp., 1955) $5.00 ; ¥ 

. Eeva K. Minn, Studies in Cheremis, Volume 4, Derivation (viii + 99 pp., 
1956) $2.00 

. Robert N. Pehrson, The Bilateral Network of Social Relations in Kénkama 
Lapp District (xiv + 128 pp., 1957) $2.50 

. Leonard Newmark, Structural Grammar of Albanian (vi + 130 pp., 1957) 
$2.50 

. Robert E. Longacre, Proto-Mixtecan (viii + 195 pp., 1957) $3.50 

. Stanley Newman, Zuni Dictionary (vi + 117 pp., 1958) $2.50 

. Robert William Albright, The International Phonetic Alphabet: Its Back- 
grounds and Development (viii + 78 pp., 1958) $2.50 

. Melville Jacobs, Clackamas Chinook Texts, Part 1 (vi + 293 pp., 1958) $4.00 

. Henry and Renée Kahane and Sol Saporta, Development of Verbal Cate- 
gories in Child Language (vi + 65 pp., 1958) $2.50 

. V. Propp, Morphology of the Folktale (x + 134 pp., 1958) $5.00. 

. Melville Jacobs, Clackamas Chinook Texts, Part II (viii + 369 pp., 1959) 
$4.00 

.. Robert B. Lees, The Grammar of English Nominalizations (xxvi + 206 pp., 
1960) $.400 

. Charles A. Ferguson and John J. Gumperz, Eds., Linguistic Diversity in 
South Asia: Studies in Regional, Social and Functional Variation (viii + 
118 pp., 1960) $3.50 


Orders will be received by Thomas A. Sebeok, Director of Publications of the 
Research Center in Anthropology, Folklore, and Linguistics, Indiana Univer- 
sity, Bloomington, Indiana 





